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Course Goals 


* Give a sense of how Vulkan is different from OpenGL 
* Show how to do basic drawing in Vulkan 


* Leave you with working, documented, understandable sample code 


http://cs.oregonstate.edu/~mjb/vulkan 
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Mike Bailey 


* Professor of Computer Science, Oregon State University 
* Has been in computer graphics for over 30 years 
* Has had over 8,000 students in his university classes 


* mjb@cs.oregonstate.edu 


Welcome! I'm happy 
to be here. I hope 
you are too ! 


= http://cs.oregonstate.edu/~mjb/vulkan 
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My Favorite Vulkan Reference 


CVuikan. 


Programming Guide 


Graham Sellers, Vulkan Programming Guide, 
Addison-Wesley, 2017. 
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Everything You Need to Know is Right Here ... Somewhere © 8 
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Top Three Reasons that Prompted the Development of Vulkan 9 


1. Performance 
2. Performance 


3. Performance 


Vulkan is better at keeping the GPU busy than OpenGL is. OpenGL drivers need to do a lot of CPU work 
before handing work off to the GPU. Vulkan lets you get more power from the GPU card you already have. 


This is especially important if you can hide the complexity of Vulkan from your customer base and just let 
them see the improved performance. Thus, Vulkan has had a lot of support and interest from game engine 
developers, 3" party software vendors, etc. 


As an aside, the Vulkan development effort was originally called “giNext’, which 
created the false impression that this was a replacement for OpenGL. It’s not. 


НІК 


ses BEYOND mjb — July 24, 2020 


22 SIGGRAPH 


THINK 
BEYOND 


OpenGL 4.2 Pipeline Overview 


OpenGL 4.2 Pipeline Flowchart 
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Who is the Khronos Group? 11 


The Khronos Group, Inc. is a non-profit member-funded industry consortium, focused on the 
creation of open standard, royalty-free application programming interfaces (APIs) for authoring and 
accelerated playback of dynamic media on a wide variety of platforms and devices. Khronos 
members may contribute to the development of Khronos API specifications, vote at various stages 
before public deployment, and accelerate delivery of their platforms and applications through early 
access to specification drafts and conformance tests. 
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Playing “Where’s Waldo” with Khronos Membership 
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Who’s Been Specifically Working on Vulkan? 13 
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Vulkan Differences from OpenGL 14 


* More low-level information must be provided (by you!) in the application, rather than the driver 
° Screen coordinate system is Y-down 


* No “current state", at least not one maintained by the driver 


* All of the things that we have talked about being deprecated in OpenGL are really 


deprecated in Vulkan: built-in pipeline transformations, begin-vertex*-end, fixed- 
function, etc. 


° You must manage your own transformations. 
* All transformation, color and texture functionality must be done in shaders. 


° Shaders are pre-"half-compiled" outside of your application. The compilation process is then 
finished during the runtime pipeline-building process. 
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Vulkan Highlights: Pipeline State Data Structure 15 


In OpenGL, your “pipeline state” is the combination of whatever your current graphics attributes 
are: color, transformations, textures, shaders, etc. 


Changing the state on-the-fly one item at-a-time is very expensive 

Vulkan forces you to set all your state variables at once into a “pipeline state object” (PSO) data 
structure and then invoke the entire PSO at once whenever you want to use that state 
combination 


Think of the pipeline state as being immutable. 


Potentially, you could have thousands of these pre-prepared pipeline state objects 
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Vulkan: Creating a Pipeline 16 


which stage (VERTEX, etc.) binding 
stride locati 
; ocation 
VkShaderModule аа binding 
format 
offset 


VkSpecializationInfo 


VkVertexInputBindingDescription 


VkVertexInputAttributeDescription 


VkPipelineShaderStageCreatelnfo 


VkPipelineVertexInputStateCreatelnfo 


Topology 


Shader stages 
VertexInput State 


VkPipelinelnputAssemblyStateCreatelnfo 


X, y, w, h, 

InputAssembly State Й 
Tesselation State VkViewportStateCreatelnfo Viewport minDepth, 
maxDepth 


Viewport State 
Rasterization State 
MultiSample State 


DepthStencil Stat 
piece. == VkPipelineDepthStencilStateCreatelnfo cullMode 


i polygonMode 
BEL frontFace 
RenderPass lineWidth 
basePipelineHandle 
basePipelinelndex 


Scissor 


VkPipelineRasterizationStateCreatelnfo 


depthTestEnable 
VkPipelineColorBlendStateCreatelnfo depthWriteEnable 


depthCompareOp 
VkGraphicsPipelineCreatelnfo stencilTestEnable 


stencilOpStateFront 
stencilOpStateBack 


VkPipelineColorBlendAttachmentState 


blendEnable 
srcColorBlendFactor 
dstColorBlendFactor 

colorBlendOp 
srcAlphaBlendFactor 
dstAlphaBlendFactor 
VkPipelineDynamicStateCreatelnfo alphaBlendOp 


colorWriteMask 


vkCreateGraphicsPipeline( ) 
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Querying the Number of Something 


uint32_t count; 
result = vkEnumeratePhysicalDevices( Instance, OUT &count, OUT (VkPhysicalDevice *)nullptr ); 


VkPhysicalDevice * physicalDevices = new VkPhysicalDevice[ count ]; 
result = vkEnumeratePhysicalDevices( Instance, OUT &count, OUT physicalDevices ); 


This way of querying information is a recurring OpenCL and Vulkan pattern (get used to it): 


How many total Where to 
there are put them 
result = vkEnumeratePhysicalDevices( Instance, &count, nullptr ); 


result = vkEnumeratePhysicalDevices( Instance, &count, physicalDevices ); 
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Vulkan Code has a Distinct “Style” of Setting Information in structs 18 
and then Passing that Information as a pointer-to-the-struct 


VkBufferCreatelnfo «О» 


vbci.sType = УК STRUCTURE,TYPE BUFFER CREATE INFO; 
vbci.pNext = nullptr; 

vbci.flags = 0; 

vbci.size = << buffer size in bytes >> 

vbci.usage = VK_USAGE_UNIFORM_B ER_BIT; 
vbci.sharingMode = VK_SHARING_MODE_BXCLUSIVE; 
vbci.queueFamilyIndexCount = 0; 

vbci.pQueueFamilyIndices = nullptr; 


VK RESULT result = vkCreateBuffer ( LogicalDevice, IN &vbci, PALLOCATOR, OUT &Buffer ); 


VkMemoryRequirements €> 


result = vkGetBufferMemoryRequirements( LogicalDevice, Buffer, OUT &vmr ); // fills vmr 


VkMemoryAllocatelnfo 
vmai.sType = VK STR ОКЕ XYPE MEMORY ALLOCATE INFO; 
vmai.pNext - nullptr; 
vmai.flags = 0; 
vmai.allocationSize = vmr.size; 
vmai.memoryTypelndex = 0; 


result = vkAllocateMemory( LogicalDevice, IN &vmai, PALLOCATOR&QUT &MatrixBufferMemoryHandle); 


result = vkBindBufferMemory( LogicalDevice, Buffer, MatrixBufferMemoryHandle, 0 ); 
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Vulkan 1.1 Reference Guide 


Vulkan? is a graphics and compute API consisting of procedures and 
functions to specify shader programs, compute kernels, objects, 
and operations involved in producing high-quality graphical images, 
specifically color images of three-dimensional objects. Vulkan is 
also a pipeline with programmable and state-driven fixed-function 
stages that are invoked by a set of specific drawing operations. 


Specification and additional resources at 
www.khronos.org/vulkan 


Return Codes [2.7.3] 


Return codes are reported via VkResult return values. 


ІК ERROR OUT ОҒ = [HOST, DEVICE] MEMORY 
Ma ERROR_{INITIALIZATION, MEMORY MAP] FAILED. 
VK ERROR. DEVICE. LOST 
VK ERROR, (EXTENSION, FEATURE, LAYER] NOT. PRESENT 
VK ERROR INOOMPATIBLE DRIVER 
VK ERROR TOO MANY OBIECTS. 
VK ERROR FORMAT NOT SUPPORTED. 
VK ERROR. FRAGMENTED, POOL 
VK ERROR OUT ОҒ POOL MEMORY 
VK ERROR INVALID EXTERNAL. HANDLE 
VKCERROR SURFACE LOST KHR 
VK ERROR NATIVE. WINDOW. IN, USE. KHR 
VKCERROR OUT OF DATE КНЕ 
VKCERROR INCOMPATIBLE DISPLAY. КНЕ. 


Devices and Queues [4] 


Physical Devices [4.1] 
VkResult vkEnumeratePhysicalDevices| 
Vkinstance instance, 


uint32_t* pPhysicalDeviceCount, 
VkPhysicalDevice® pPhysicalDevices); 
oid vkGetPhysicalDevic 

VP Physical Device physicalDer 

VkPhysicalDeviceProperties* pProperties]; [I 
void vkGetPhysicalDeviceProperties2( 

VkPhysicalDevice physicalDevice, 

VkPhysicalDeviceProperties2* pProperties); 
‘typedef struct VkPhysicalDeviceProperties? { 

ViStructureType sType; EXT} 

void 


ViPhySicalDeviceProperties properties; [ЕЕ 
] VkPhysicalDeviceProperties2; 


pNext must be NULL or point to one of: 


void vkGetPhysicalDeviceQueueFamilyProperties{ 
VkPhysicalDevice physicalDevice, 
vint32_t* pQueueFamilyPropertyCount, 
VkQueueFamilyPropertes* 
‘pQueveFamilyProperties); 
void vkGetPhysicalDeviceQueueFamilyProperties2( 
32 rente ropertyCo 
vint: unt, 
VkQueueFamilyProperties2* pQueueFamilyProperties); 
‘typedef struct VkQueueFamilyProperties { 
VkQueueFlags queveFlags; 
uint32_ Y i 
uint32_t timestompVolidBits; 
VkExtent3D minimogeTransferGronulority; 
]VkQueueFamilyProperties; 


queueFlags: 
VK. QUEUE. X ВІТ where X is GRAPHICS, COMPUTE, 
TRANSFER, PROTECTED, SPARSE. BINDING 


‘typedef struct VkQueueFamilyProperties2 { 
WiStructurelype Type; Ст) 
m iefamilyProperties 

ah iei 
JWkasevefemiyiroperuesa: 


Vulkan Quick Reference Card – | Recommend you Print This! 


Vulkan. KHRONOS 


Color coded names as follows: function names and structure names 
[n.n.n] Indicates sections and text in the Vulkan API 1.1 Specification. 
ЕСІ indicates a page in this reference guide for more information. 


Ef indicates reserved for future use. 
pNext must either be NULL, or point to a valid structure which extends the base 
structure according to the valid usage rules of the base structure. 


Command Function Pointers and Instances [3] 


Command Function Pointers [3.1] 
РЕМ vkVoidFunction vkGetInstanceProcAddr( 
Vkinstance instance, const char* T 
PFN_vkVoidFunction vkGetDeviceProcAddi 
VkDevice device, const char* pues 
PFN_vkVoidFunction is: 


‘typedef void|VKAPI_PTR® PFN. vkVoidFunction)(void]; 


Instances [3.2] 
VkResult vkEnumeratelnstanceVersion{ 
uint32_t* pApiVersion); 
VkResult vkCreatelnstance{ 
const ‘VkinstanceCreatelnfo* pCreateinfo, 
const VkAllocationCallbacks* pAllocator, CEI 
Vnstance* instance); 
struct VkinstanceCreatelnfo { 
Puce sType; СТЕ 
const void* pNext; 
VkinstanceCreateFiags flogs; ЕГІ 
const VkApplicationinfo* pApplicationinfo; 
uint32 t enabledLayerCount; 
const char* const* nie pene 
uint32_t enabledExtensionCount; 


typedef: struct VkPhyscalDesiceGroupProperis { 
ViStruczure Type sType; La 


icalDevice physicalDevices| 
VK MAX DEVICE GROUP SIZE]; 
VkBool32 subsetAliocation; 

] VkPhysicalDeviceGroupProperties; 


Device Creation [4.2.1] 
"date pee 
hysicalDevice 
st VkDeviceCreatelnfo* crea atelnfo, 
= VkAllocationCallbacks* pAllocator, 
VkDevice* pDevice); 


plex must be NULL or point to one of: 
VkDeviceGroupDeviceCreateinfo 
VkPhysicalDevice16BitStorageFeatures [XT] 
VkPhysicalDeviceFeatures2 LIT] 
VkPhysicalDeviceMultiviewFeatures [XT] 
VkPhysicalDeviceProtectedMemoryFeatures (22) 
VkPhysicalDeviceSamplerlcberConversionFeatures 
VPhysicalDeviceVariablePointerFeatures. 


түреде! struct ViericeGroupDeviceCreatenfo t 
ViStructureType sType; 
седе Baie 


ео сое 
]VkDeviceGroupDeviceCreatelnfo; 


Device Destruction р А] 


const VidllocatonCalbacks* pAllocator); C} 


const char* const* ppEnabledExtensionNames; 
] VkinstanceCreatelnfo; 


түреде! struct VkApplicationinfo | 
ViStructureType sType; ГТ 


uint32_t apiVersion; 
) VkApplicationinto; 


void vkDestroyinstance( 
Vkinstance instance, 
const VkAllocationCallbacks* pAllocator}; EXT 


Queues dis 2 
ее Туре; GIO) 
const void* pNext; 
Па flags; 
uint32_t queueFamilyindex; 
uint32 t queueCount; 
const fioat* pQueuePriorities; 
] VkDeviceQueueCreatelnfo; 
‘flags: VK_DEVICE_QUEUE_CREATE_PROTECTED_BIT 
void vkGetDeviceQueue(VkDevice device, 
uint32_t queueFamilyindex, uint32_t queuelndex, 
VkQueue* pQueue]; 
void vkGetDeviceQueue2(VkDevice device, 
const VkDeviceQueuelnfo2* pQueueinfo, 
VkQueue* pQueue]; 


typedef struct VkDeviceQueuelnfoZ | 
\VkStructureType sType; СЕ 
const void* pNext; 
VkDeviceQueueCreateFiags flags; 
uint32_t queueFamilylndex, uintà2, 1 queuelndex; 
1VkDeviceQueuelnfo2; 
Лодз: VK DEVICE QUEUE CREATE PROTECTED ВІТ 


Command Buffers [5] 
Also see Command Buffer Lifecyde diagram. 


const Vitommanapooicestnto* go 


uint32_t qi 
} vkCommandPoolCreatelnto, Ө 
flags: VK COMMAND POOL CREATE X BIT where Xis 
PROTECTED, RESET COMMAND BUFFER, TRANSIENT 
void vkTrimCommandPool(VkDevice device, 
VkCommandPool commandPool, 
VkCommandPoolTrimFlags 1005); ET 
VkResult vkResetCommandPool{ 
WkDevice device, VkCommandPool commandPool, 
VkCommandPoolResetFlags flags); 
flags: VK COMMAND POOL RESET. RELEASE - 
RESOURCES. 


void vkDestroyCommandPool 
VkDevice device, VkCommandPool commandPoo!, 


https://www.khronos.org/files/vulkan11-reference-guide.pdf 
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Vulkan Quick Reference Card 20 


Vulkan 1.1 Reference Guide 


Vulkan Pipeline Diagram [9] 


Some Vulkan commands specify geometric objects 
to be drawn or computational work to be performed, 
while others specify state controlling how objects 
are handled by the various pipeline stages, or control 
data transfer between memory organized as images 
and buffers. Commands are effectively sent through 
a processing pipeline, either a graphics pipeline or a 
compute pipeline. 

E Fixed Function Stage 

Г Shader Заре 


| Storage Images 


https://www.khronos.org/files/vulkan11-reference-guide.pdf 
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Vulkan Highlights: Overall Block Diagram 
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Vulkan Highlights: a More Typical Block Diagram e 
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Steps in Creating Graphics using Vulkan 23 


Create the Vulkan Instance 

Setup the Debug Callbacks 

Create the Surface 

List the Physical Devices 

Pick the right Physical Device 
Create the Logical Device 

Create the Uniform Variable Buffers 
Create the Vertex Data Buffers 
Create the texture sampler 

10. Create the texture images 

11. Create the Swap Chain 

12. Create the Depth and Stencil Images 
13. Create the RenderPass 

14. Create the Framebuffer(s) 

15. Create the Descriptor Set Pool 

16. Create the Command Buffer Pool 
17. Create the Command Buffer(s) 

18. Read the shaders 

19. Create the Descriptor Set Layouts 
20. Create and populate the Descriptor Sets 
21. Create the Graphics Pipeline(s) 

22. Update-Render-Update-Render- ... 


Se IO ОЕ № = 


H THINK 


ses BEYOND mjb — July 24, 2020 


24 


Vulkan. 


The Vulkan Sample Code Included with These Notes 


Mike Bailey 


mjb@cs.oregonstate.edu 


http://cs.oregonstate.edu/~mjb/vulkan 
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Sample Program Output 25 


8 ' Vulkan Sample x 


THINK 4 
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Sample Program Keyboard Inputs 


T: 
'm', "М": 
'р’, 'P': 
'q', 'Q': 
Esc: 


'т', 'R': 


1. ‘4’, “О? 


Toggle lighting off and on 

Toggle display mode (textures vs. colors, for now) 
Pause the animation 

quit the program 

quit the program 


Toggle rotation-animation and using the mouse 


Toggle using a vertex buffer only vs. an index buffer 
(in the index buffer version) 


Set the number of instances 
(in the instancing version) 
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Caveats on the Sample Code, | 27 


1. Гуе written everything out in appalling longhand. 


2. Everything is in one .cpp file (except the geometry data). It really should be 
broken up, but this way you can find everything easily. 


3. At times, | could have hidden complexity, but | didn't. At all stages, | have tried 
to err on the side of showing you everything, so that nothing happens in a way 
that’s kept a secret from you. 


4. I’ve setup Vulkan structs every time they are used, even though, in many cases 
(most?), they could have been setup once and then re-used each time. 


5. At times, I’ve setup things that didn't need to be setup just to show you what 
could go there. 
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Caveats on the Sample Code, Il 28 


6. There are great uses for C++ classes and methods here to hide some complexity, but 
Гуе not done that. 


7. Гуе typedef'ed a couple things to make the Vulkan phraseology more consistent. 


8. Even though it is not good software style, | have put persistent information in global 
variables, rather than a separate data structure. | hope it is clearer this way. 


9. At times, | have copied lines from vulkan.h into the code as comments to show you 
what certain options could be. 


10.l've divided functionality up into the pieces that make sense to me. Many other 
divisions are possible. Feel free to invent your own. 
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Main Program 29 


int 
main( int argc, char * argv[ ] ) 


Width = 800; 
Height = 600; 


errno_t err = fopen_s( &FpDebug, DEBUGFILE, "w" ); 
і err != 0) 
( 


fprintf( stderr, "Cannot open debug print file '%s'\n", DEBUGFILE ); 
FpDebug = stderr; 


} 
fprintf(FpDebug, "FpDebug: Width = %d ; Height = %d\n", Width, Height); 


Reset( ); 
InitGraphics( ); 


Il loop until the user closes the window: 
while( glfwWindowShouldClose( MainWindow ) == 0 ) 
{ 


glfwPollEvents( ); 

Time = glfwGetTime( ); II elapsed time, in double-precision seconds 
UpdateScene( ); 

RenderScene( ); 


) 


fprintf(FpDebug, "Closing the GLFW window\n"); 


vkQueueWaitldle( Queue ); 
vkDeviceWaitldle( LogicalDevice ); 
DestroyAllVulkan( ); 
glfwDestroyWindow( MainWindow ); 
glfwTerminate( ); 

return 0; 
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Vulkan Conventions 
VkXxx is a typedef, probably a struct 


vkYyy( ) is a function call 
VK ZZZ is a constant 


My Conventions 


“Init” in a function call name means that something is being setup that only needs to be setup 
once 


The number after “Init” gives you the ordering 


In the source code, after main( ) comes InitGraphics( ), then all of the InitxxYYY( ) functions in 
numerical order. After that comes the helper functions 


“Find” in a function call name means that something is being looked for 
“ЕШ” in a function call name means that some data is being supplied to Vulkan 


"IN" and "OUT" ahead of function call arguments are just there to let you know how an 
argument is going to be used by the function. Otherwise, IN and OUT have no significance. 


O ЕШ They are actually Zdefine'd to nothing. 
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Your Sample2019.zip File Contains This 


Linux shader compiler 


Windows shader compiler 


Double-click here to 
launch Visual Studio 2019 
with this solution 


phare View 


Р This PC > п (\\guille\bailey\users) (Y:) > Vulkan > Sample2019 > 


A 


Name 


ER 

n Debug 

[] gim 

| | glm.0.9.8.5 
| | glm-0.9.9-a2 
Ú ERRORS. pptx 
С frag.spv 
glfw3.h 

BRR gifw3.lib 


| glslangValidator 
1-| glslangValidator.exe 


|_| glslangValidator.help 
2 Makefile 

i] puppy.bmp 
puppy.jpg 

| puppy0.bmp 
puppy0.jpg 

++ sample.cpp 


+! Sample.vcxproj 

E Sample.vexprojfilters 
Fal Sample.vcxproj.user 
= sample08.pdf 
=È sample09.pdf 
=È sample10.pdf 


| sample-comp.comp 


ў sample-comp.spv 


LI 
kd 
A sample-frag.frag 


Date modified 


8/2016 5:06 AM 
{31/2017 5:24 PM 
)17 12:33 PM 
6/2017 2:31 PM 
2018 11:41 AM 


/10/2018 8:13 AM 


File folder 
File folder 
File folder 
File folder 
File folder 


Microsoft PowerP... 


SPV File 
C/C++ Header 
Object File Library 
File 
Application 
HELP File 

File 

BMP File 

JPG File 

BMP File 

JPG File 

C++ Source 


C++ Source 


Microsoft Visual S... 


VC++ Project 


VC++ Project Filte... 
Per-User Project O... 
Adobe Acrobat D... 
Adobe Acrobat D... 


Adobe Acrobat D... 


COMP File 
SPV File 
FRAG File 


The “19” refers to the version of Visual Studio, not the year of development. 
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Reporting Error Results, | 32 


struct errorcode 
VkResult resultCode; 
std::stringmeaning; 

ErrorCodes[ ] = 

( 
(VK NOT READY, "Not Ready" y, 
{ VK_TIMEOUT, "Timeout* }, 
{ VK_EVENT_SET, "Event Set“ }, 
{ VK_EVENT_RESET, "Event Reset“ y, 
{ VK_INCOMPLETE, "Incomplete“ y, 
{ VK_ERROR_OUT_OF_HOST_MEMORY, "Out of Host Memory“ }, 
{ VK_ERROR_OUT_OF_DEVICE_MEMORY, "Out of Device Memory“ }, 
{ VK_ERROR_INITIALIZATION_FAILED, "Initialization Failed“ y, 
{ VK_ERROR_DEVICE_LOST, "Device Lost" y, 
(VK ERROR MEMORY MAP FAILED, "Memory Map Failed" }, 
(VK ERROR LAYER NOT PRESENT, "Layer Not Present" y, 
{ VK_ERROR_EXTENSION_NOT_PRESENT, "Extension Not Present" }, 
{ VK_ERROR_FEATURE_NOT_PRESENT, "Feature Not Present" y, 
{ VK_ERROR_INCOMPATIBLE_DRIVER, "Incompatible Driver" y, 
{ VK_ERROR_TOO_MANY_OBJECTS, "Too Many Objects“ y; 
(VK ERROR FORMAT NOT SUPPORTED, "Format Not Supported" }, 
(VK ERROR FRAGMENTED POOL, "Fragmented Pool" y, 
(VK ERROR SURFACE LOST KHR, "Surface Lost" y, 
(VK ERROR NATIVE WINDOW IN USE KHR, "Native Window in Use“ }, 
(VK SUBOPTIMAL KHR, "Suboptimal" }, 
(VK ERROR OUT ОҒ DATE КНЕ, "Error Out of Date" y, 
{ VK_ERROR_INCOMPATIBLE_DISPLAY_KHR, "Incompatible Display“ y, 
{ VK_ERROR_VALIDATION_FAILED_ EXT, "Valuidation Failed“ y, 
(VK ERROR INVALID SHADER NV, "Invalid Shader“ y, 
(VK ERROR OUT OF POOL MEMORY KHR, "Out of Pool Memory" È 
{ VK_ERROR_INVALID_EXTERNAL_HANDLE, "Invalid External Handle“ }, 
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Reporting Error Results, Il 33 


void 
PrintVkError( VkResult result, std::string prefix ) 


{ 
if (Verbose && result == VK_SUCCESS) 
{ 


fprintf(FpDebug, "%s: %s\n", prefix.c_str(), "Successful"); 
fflush(FpDebug); 
return; 


} 


const int numErrorCodes = sizeof( ErrorCodes ) / sizeof( struct errorcode ); 
std::string meaning = ""; 
for( int i = 0; i < numErrorCodes; i++ ) 


{ 

ІК result == ErrorCodes[i].resultCode ) 
meaning = ErrorCodes[i].meaning; 
break; 

} 

} 


fprintf( FpDebug, "\n%s: %s\n", prefix.c_str(), meaning.c str() ); 
fflush(FpDebug); 
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Extras in the Code 34 


#define REPORT(s) { PrintVkError( result, s ); fflush(FpDebug); } 

define HERE | АМ(5) ІК Verbose ) { fprintf( FpDebug, "***** 95s *****\n", s ); fflush(FpDebug); ) 
bool Paused; 

bool Verbose; 

define DEBUGFILE "VulkanDebug.txt" 

errno terr = fopen s( &FpDebug, DEBUGFILE, "w" ); 

const int32 t OFFSET ZERO = 0; 
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Vulkan. 


Drawing 
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mjb@cs.oregonstate.edu 
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Vulkan Topologies 


VK. PRIMITIVE. TOPOLOGY. POINT. LIST VK PRIMITIVE TOPOLOGY TRIANGLE LIST 


e V4 V 
©. 
ө 


Vo ө Yi Vo 


VK_PRIMITIVE_TOPOLOGY_TRIANGLE_STRIP 


VK_PRIMITIVE_TOPOLOGY_LINE_LIST 


VK_PRIMITIVE_TOPOLOGY_TRIANGLE_FAN 


V3 
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Vulkan Topologies 


typedef enum VkPrimitiveTopology 
{ 


VK_PRIMITIVE_TOPOLOGY_POINT_LIST 
VK_PRIMITIVE_TOPOLOGY_LINE_LIST 
VK_PRIMITIVE_TOPOLOGY_LINE_STRIP 
VK_PRIMITIVE_TOPOLOGY_TRIANGLE_LIST 
VK_PRIMITIVE_TOPOLOGY_TRIANGLE_STRIP 

VK PRIMITIVE TOPOLOGY TRIANGLE FAN 

VK PRIMITIVE TOPOLOGY LINE LIST WITH ADJACENCY 

VK PRIMITIVE TOPOLOGY LINE STRIP. WITH ADJACENCY 
VK PRIMITIVE TOPOLOGY TRIANGLE LIST WITH ADJACENCY 
VK PRIMITIVE TOPOLOGY TRIANGLE STRIP WITH ADJACENCY 
VK PRIMITIVE TOPOLOGY PATCH LIST 
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A Colored Cube Example 
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static GLuint CubeTriangleIndicesi[ ][3] 
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Triangles Represented as an Array of Structures 39 


From the file SampleVertexData.cpp: 


struct vertex 
{ 
glm::vec3 position; 
glm::vec3 normal; 
glm::vec3 color; 
glm::vec2 texCoord; 
}; 
struct vertex VertexData[ ]= 
{ 
II triangle 0-2-3: 
II vertex #0: 
{ 
{ -1., -1., -1. }, 
{ 0., 0., -1. }, 
( 0., 0., 0. }, 
{ 1.,0.} 
}, 
II vertex #2: 
5 dida Modeled in 
( 0, 0,,-1.}, right-handed 
Ls Tu He coordinates 
( 1.1.) 
) 
II vertex #3: 
( 
{ 1., 1., -1.}, 
{ 0., 0., -1.}, 
{ 1., 1, 0. }, 
( 0.,1.} 
}, 


THINK 
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Non-indexed Buffer Drawing 40 


From the fite SampleVertexData.cpp: 


struct vertex 
{ 
glm::vec3 position; Vertex 7 
glm::vec3 normal; 
glm::vec3 color; 
glm::vec2 texCoord; 


Stream of Vertices 


}; 


struct vertex VertexData[ ] = 


( 


ІІ vertex #0: 


{ -1., -1., -1. }, 
( 0., 0., -1. }, 
5 0., 0., 0. }, 
{ 1.,0.} 

}, 

II vertex #2: 

{ 


{ -1., 5 
( 0., 0., 
{ 0., 1., 0. }, 
21.1.) 
| 1 
II vertex #3: Triangles 
( 
471.) 
# ™ s. 
ao 
, 


THINK 
im H BEYOND 


mjb — July 24, 2020 


Filling the Vertex Buffer 41 


struct vertex VertexData[ ] = 


{ 
|? 


MyBuffer MyVertexDataBuffer; 


Init05MyVertexDataBuffer( sizeof(VertexData), OUT &MyVertexDataBuffer ); 
FillobDataBuffer( MyVertexDataBuffer, (void *) VertexData ); 


VkResult 
Init05MyVertexDataBuffer( IN VkDeviceSize size, OUT MyBuffer * pMyBuffer ) 


VkResult result; 
result = InitobDataBuffer( size, УК BUFFER USAGE VERTEX BUFFER ВІТ, pMyBuffer ); 
return result; 


- m 
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A Preview of What Init05DataBuffer Does 42 


VkResult 
Init05DataBuffer( VkDeviceSize size, VkBufferUsageFlags usage, OUT MyBuffer * pMyBuffer ) 


VkResult result = VK CCESS; 
VkBufferCreateln 
vbci.sType = УК” UCTURE_TYPE_BUFFER_CREATE_INFO; 
vbci.pNext = nullptr; 
vbci.flags = 0; 
vbci.size = pMyBuffer->si2e 
vbci.usage = usage; 
vbci.sharingMode = VK_SHARING_MODE_EXCLUSIVE; 
vbci.queueFamilyIndexCount = 0; 
vbci.pQueueFamilyIndices = (const 941132 t *)nullptr; 
result = vkCreateBuffer ( LogicalDevice, IN &vbci, PALLOCATOR, OUT &pMyBuffer->buffer ); 


VkMemoryRequirements Ж» 
vkGetBufferMemoryRequirements(/LogicalDevice, IN pMyBuffer->buffer, ry Il fills vmr 


VkMemoryAllocatelnfo 
vmai.sType = VK_STRUC 
vmai.pNext = nullptr; 


VkDeviceMemory 
result = vkAllocateMemo 
pMyBuffer->vdm = vdm; 


result = vkBindBufferMemory( LogicalDevice, pMyBuffer->buffer, IN vdm, 0 ); Il 0 is the offset 
return result; 


m 
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Telling the Pipeline about its Input 43 


We will come to the Pipeline later, but for now, know that a Vulkan pipeline is essentially a very large 
data structure that holds (what OpenGL would call) the state, including how to parse its input. 


C/C++: 


GLSL Shader: 


layout( location = 0 ) in vec3 aVertex; 
layout( location = 1 ) іп vec3 aNormal; 


struct vertex 


glm:: 
glm::vec3 
glm::vec3 
glm:: 


position; 


layout( location = 2 ) in vec3 aColor; 
layout( location = 3 ) in vec2 aTexCoord; 


VkVertexInputBindingDescription vvibd[1]; // one of these per buffer data buffer 
vvibd[0].binding = 0; Il which binding # this is 

vvibd[O].stride = sizeof( struct vertex ); Il bytes between successive structs 
vvibd[0].inputRate = VK VERTEX INPUT RATE VERTEX; 
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Telling the Pipeline about its Input 44 


struct vertex 


{ 


layout( location = 0 ) in vec3 aVertex; 


glm::vec3 position; layout( location = 1 ) in vec3 aNormal; 


glm::vec3 normal; 
glm::vec3 color; 
glm::vec2 texCoord; 


layout( location = 2 ) in vec3 aColor; 
layout( location = 3 ) in vec2 aTexCoord; 


y 
VkVertexInputAttributeDescription vviad[4]; Il array per vertex input attribute 
Il 4 = vertex, normal, color, texture coord 
vviad[0].location = 0; Il location in the layout decoration 
vviad[0].binding = 0; Il which binding description this is part of 
vviad[0].format = VK_FORMAT_VEC3; Il x, у, 2 
vviad[0].offset = offsetof( struct vertex, position ); Ше 


Always use the C/C++ 


vviad[1].location = 1; 

vviad[1].binding = 0; 

vviad[1].format = VK_FORMAT_VEC3; Il nx, ny, nz 
vviad[1].offset = offsetof( struct vertex, normal ); I| 12 


construct offsetof, rather than 
hardcoding the value! 


vviad[2].location = 2; 

vviad[2].binding = 0; 

vviad[2].format = V&K FORMAT VEC3; Ит, g, b 
vviad[2].offset = offsetof( struct vertex, color ); Il 24 


vviad[3].location = 3; 

vviad[3].binding = 0; 

vviad[3].format = VK_FORMAT_VEC2; I| s,t 

vviad[3].offset = offsetof( struct vertex, texCoord ); Il 36 
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Telling the Pipeline about its Input 45 


We will come to the Pipeline later, but for now, know that a Vulkan Pipeline is essentially a very large 
data structure that holds (what OpenGL would call) the state, including how to parse its vertex input. 


VkPipelineVertexInputStateCreatelnfo vpvisci; Il used to describe the input vertex attributes 
vpvisci.sType = VK STRUCTURE TYPE PIPELINE VERTEX INPUT STATE CREATE INFO; 
vpvisci.pNext = nullptr; 
vpvisci.flags 7 0; 
vpvisci.vertexBindingDescriptionCount = 1; 
vpvisci.pVertexBindingDescriptions = vvibd; 
vpvisci.vertexAttributeDescriptionCount = 4; 
vpvisci.pVertexAttributeDescriptions = vviad; 


VkPipelinelnputAssemblyStateCreatelnfo vpiasci; 
vpiasci.sType = МК STRUCTURE TYPE PIPELINE INPUT ASSEMBLY STATE CREATE INFO; 
vpiasci.pNext = nullptr; 
vpiasci.flags = 0; 
vpiasci.topology = УК PRIMITIVE TOPOLOGY TRIANGLE LIST;; 
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Telling the Pipeline about its Input 46 


We will come to the Pipeline later, but for now, know that a Vulkan Pipeline is essentially a very large 
data structure that holds (what OpenGL would call) the state, including how to parse its vertex input. 


VkGraphicsPipelineCreatelnfo 
vgpci.sType = УК STRUCTURE ТҮР 
vgpci.pNext = nullptr; 
vgpci.flags = 0; 
vgpci.stageCount = 2; Il number of sh 
vgpci.pStages = vpssci; 
vgpci.pVertexInputState = &vpvisci; 
vgpci.pInputAssemblyState = &vpiasci; 
vgpci.p TessellationState = (VkPipelineTessellationStat&Createlnfo *)nullptr; Il &vptsci 
vgpci.pViewportState = &vpvsci; 
vgpci.pRasterizationState = &vprsci; 
vgpci.pMultisampleState = &vpmsci; 
vgpci.pDepthStencilState = &vpdssci; 
vgpci.pColorBlendState = &vpcbsci; 
vgpci.pDynamicState = &vpdsci; 
vgpci.layout = IN GraphicsPipelineLayout; 
vgpci.renderPass = IN RenderPass; 
vgpci.subpass = 0; // subpass number 
vgpci.basePipelineHandle = (VkPipeline) VK NULL HANDLE; 
vgpci.basePipelinelndex = 0; 


RAPHICS PIPELINE CREATE INFO; 


der stages in this pipeline 


result = vkCreateGraphicsPipelines( LogicalDevice, VK NULL HANDLE, 1, IN &vgpci, 
PALLOCATOR, OUT &GraphicsPipeline ); 
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Telling the Command Buffer what Vertices to Draw 47 


We will come to Command Buffers later, but for now, know that you will specify the vertex buffer 
that you want drawn. 


VkBuffer buffers[1] = MyVertexDataBuffer. buffer; 


vkCmdBindVertexBuffers( CommandBuffers[nextlmagelIndex], 0, 1, vertexDataBuffers, offsets ); 


const uint32 ( 

const uint32 a Always use the C/C++ 
const uint32_t firstVertex = 0; construct sizeof, rather than 
const uint32_t firstInstance = 0; hardcoding a count! 


vkCmdDraw( CommandBuffers[nextlmagelndex], vertexCount, instanceCount, firstVertex, firstInstance ); 


THINK 
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Drawing with an Index Buffer 48 


struct vertex JustVertexData[ ] = 
{ Stream of Vertices Stream of Indices 


ІІ vertex #0: 

{ {-1.,-1.,-1.}, Vertex 7 
{ 0., 0., -1. }, 
{ 0., 0., 0.}, Vertex 5 
{ 1.,,0.} 

}, 

Il vertex #1: 

{ 
{ 1., -1., -1. 
( 0., 0., -1. Vertex 3 
{ 1., 0., 0. 
{ 0.0.) 

}, 

int JustIndexData[ 
{ 

0, 2, 3, 

0, 3, 1, 

4,5, 7, 

4,7,6, 

1,3,7, 

1, 7, 5, Triangles 

0,4, 6, 

0,6,2, 

TE П 

2,7,3, 

0, 5, 4, 
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Drawing with an Index Buffer 49 


vkCmdBindVertexBuffers( commandBuffer, firstBinding, bindingCount, vertexDataBuffers, vertexOffsets ); 


vkCmdBindlndexBuffer( commandBuffer, indexDataBuffer, indexOffset, indexType ); 


typedef enum VklndexType 


VK INDEX TYPE (ІМТ16-0,//0- 65,535 
VK INDEX TYPE UINT32 = 1, // 0 — 4,294,967,295 
) VkIndexType; 


vkCmdDrawIndexed( commandBuffer, indexCount, instanceCount, firstlndex, vertexOffset, firstlnstance); 


THINK 
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Drawing with an Index Buffer 50 


VkResult 
Init05MyIndexDataBuffer(IN VkDeviceSize size, OUT MyBuffer * pMyBuffer) 


VkResult result = Init05DataBuffer(size, УК BUFFER USAGE INDEX BUFFER ВІТ, pMyBuffer); 
// fills pMyBuffer 
return result; 


Init05MyVertexDataBuffer( sizeof(JustVertexData), IN &MyJustVertexDataBuffer ); 
FillobDataBuffer( MyJustVertexDataBuffer, (void *) JustVertexData ); 


Init05MyIndexDataBuffer( sizeof(JustIndexData), IN &MyJustIndexDataBuffer ); 
FillobDataBuffer( MyJustIndexDataBuffer, (void *) JustIndexData ); 
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Drawing with an Index Buffer 51 


VkBuffer vBuffers[1] = { MyJustVertexDataBuffer.buffer }; 
VkBuffer iBuffer = ( MyJustIndexDataBuffer.buffer }; 


vkCmdBindVertexBuffers( CommandBuffers[nextlmagelndex], 0, 1, vBuffers, offsets ); 
Il 0, 1 = firstBinding, bindingCount 
vkCmaBindIndexBuffer( CommandBuffers[nextlmagelndex], iBuffer, 0, УК INDEX TYPE UINT32 ); 


const uint32 t vertexCount = sizeof( JustVertexData ) / sizeof( JustVertexData[0] ); 
const uint32 t indexCount = sizeof( JustlndexData ) / sizeof( JustlndexData[0] ); 
const uint32 t instanceCount = 1; 

const uint32 t firstVertex = 0; 

const uint32 t firstlndex = 0; 

const uint32 t firstlnstance = 0; 

const uint32 t vertexOffset = 0; 


vkCmdDrawIndexed( CommandBuffers[nextlmagelndex], indexCount, instanceCount, firstIndex, 
vertexOffset, firstlnstance ); 
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Sometimes the Same Point Needs Multiple Attributes 52 


Sometimes a point that is common tọø#fíultiple faces has the same attributes, no matter what 


face itis in. Sometimes it doesn't. 


A color-interpolated cube like this actually has both. Point #7 above has the same color, 
regardless of what face it is in. However, Point #7 has 3 different normal vectors, depending 
on which face you are defining. Same with its texture coordinates. 


Thus, when using indexed buffer drawing, you need to create a new vertex struct if any 
of (position, normal, color, texCoords} changes from what was previously-stored at 
those coordinates. 
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Sometimes the Same Point Needs Multiple Attributes 53 


Where values do not match at the 
corners (texture coordinates) 


Where values match at the 
corners (color) 
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Vulkan. 


Shaders and SPIR-V 


Mike Bailey 


mjb@cs.oregonstate.edu 


http://cs.oregonstate.edu/~mjb/vulkan 


x THINK 


mjb — July 24, 2020 


55 
The Shaders’ View of the Basic 


Computer Graphics Pipeline 


* In general, you want to have a vertex and 
fragment shader as a minimum. 


“А missing stage is ОК. The output from one 
stage becomes the input of the next stage 
that is there. 


“Тһе last stage before the fragment shader 
feeds its output variables into the rasterizer. 
The interpolated values then go to the 
fragment shaders 


= Fixed Function 


| | = Programmable 
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56 
Vulkan Shader Stages 


Shader stages 


typedef enum VkPipelineStageFlagBits { 
VK PIPELINE STAGE TOP OF РІРЕ BIT = 0x00000001, 
VK PIPELINE STAGE DRAW INDIRECT BIT = 0x00000002, 
VK PIPELINE STAGE VERTEX INPUT BIT = 0x00000004, 
K PIPELINE STAGE VERTEX SHADER BIT - 0x00000008, 
K PIPELINE STAGE TESSELLATION CONTROL SHADER BIT = 0x00000010, 
K PIPELINE STAGE TESSELLATION EVALUATION SHADER BIT = 0x00000020, 
K PIPELINE STAGE GEOMETRY SHADER BIT = 0x00000040, 
VK PIPELINE STAGE FRAGMENT SHADER BIT - 0x00000080, 
VK PIPELINE STAGE EARLY FRAGMENT TESTS BIT = 0x00000100, 
VK PIPELINE STAGE LATE FRAGMENT TESTS BIT = 0x00000200, 
VK PIPELINE STAGE COLOR ATTACHMENT OUTPUT BIT = 0x00000400, 
VK PIPELINE STAGE COMPUTE SHADER BIT = 0x00000800, 
VK PIPELINE STAGE TRANSFER BIT = 0x00001000, 
VK PIPELINE STAGE BOTTOM OF PIPE BIT = 0x00002000, 
VK PIPELINE STAGE HOST BIT = 0x00004000, 
VK PIPELINE STAGE ALL GRAPHICS BIT = 0x00008000, 
VK PIPELINE STAGE ALL COMMANDS BIT = 0x00010000, 
) VkPipelineStageFlagBits; 


H ru 
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How Vulkan GLSL Differs from OpenGL GLSL 


Detecting that a GLSL Shader is being used with Vulkan/SPIR-V: 


* In the compiler, there is an automatic 
define VULKAN 100 


Vulkan Vertex and Instance indices: OpenGL uses: 
gl VertexIndex gl VertexID 
gl Instancelndex gl InstancelD 


* Both are 0-based 


gl FragColor: 


* |n OpenGL, gl FragColor broadcasts to all color attachments 
* |n Vulkan, it just broadcasts to color attachment location #0 


* Best idea: don't use it at all — explicitly declare out variables to have specific location numbers 


H ru 
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How Vulkan GLSL Differs from OpenGL GLSL 58 


Shader combinations of separate texture data and samplers: 
uniform sampler s; 


uniform texture2D t; Note: our sample code 
vec4 rgba = texture( sampler2D( t, s ), Т ); doesn't use this. 


Descriptor Sets: 
layout( set=0, binding=0 )... ; 


Push Constants: 
layout( push constant )... ; 


Specialization Constants: 
layout( constant id = 3) const int N = 5; 


* Only for scalars, but a vector's components can be constructed from specialization constants 


Specialization Constants for Compute Shaders: 
layout( local size x id = 8, local size y id = 16); 


* This sets gl WorkGroupSize.x and gl WorkGroupSize.y 
* gl WorkGroupSize.z is set as a constant 
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Vulkan: Shaders’ use of Layouts for Uniform Variables 59 


Il non-sampler variables must be in a uniform block: - - 
layout( 514140, set = 0, binding = 0 ) uniform matBuf All non-sampler uniform variables 
must be in block buffers 


mat4 uModelMatrix; 
mat4 uViewMatrix; 
mat4 uProjectionMatrix; 
mat3 uNormalMatrix; 

} Matrices; 


Il non-sampler variables must be in amiform block: 
layout( 514140, set = 1, binding =) uniform lightBuf 


{ 
vec4 uLightPos; 
} Light; 


layout( set = 2, binding = 0 ) uniform sampler2D uTexUnit; 


shaderModuleCreateFlags coner] (ü int32 0 


codeSize (in bytes) 


VkShaderModuleCreatelnfo( ) 


device 
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225 


ол + Q № — 


TEN 


. Software vendors don't need to ship their shader source 

. Syntax errors appear during the SPIR-V step, not during runtime 

. Software can launch faster because half of the compilation has already taken place 
. This guarantees a common front-end syntax 

. This allows for other language front-ends 


Vulkan Shader Compiling 60 


* You half-precompile your shaders with an external compiler 


* Your shaders get turned into an intermediate form known as SPIR-V, which stands 
for Standard Portable Intermediate Representation. 


* SPIR-V gets turned into fully-compiled code at runtime, when the pipeline 
structure is finally created 


* The SPIR-V spec has been public for a few years -new shader languages аге 
surely being developed 


* OpenGL and OpenCL have now adopted SPIR-V as well 


pide Compiler in " 
GLSL Source ---> GLSL — , SPRV — Vendor-specific 
з соде 
Compiler 


- Run Time 
Develop Time Advantages: 


mjb — July 24, 2020 


SPIR-V: 61 
Standard Portable Intermediate Representation for Vulkan 


glslangValidator shaderFile -V [-H] [-l<dir>] [-S <stage>] -o shaderBinaryFile.spv 


Shaderfile extensions: 

егі Vertex 

.tesc Tessellation Control 

.tese Tessellation Evaluation 
.geom Geometry 

.frag Fragment 

.comp Compute 

(Can be overridden by the —S option) 


-V Compile for Vulkan 

-G Compile for OpenGL 

-| Directory(ies) to look in for #includes 

-S Specify stage rather than get it from shaderfile extension 
-C Print out the maximum sizes of various properties 


Windows: glslangValidator.exe 
Linux: glslangValidator 


a THINK 
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Running glslangValidator.exe 62 


gislangValidator.exe 


sample-vert.ve 


-о sample-vert.spv 


k. 


Compile for Vulkan (“-G” is compile for OpenGL) Specify the output file 


The input file. The compiler determines the shader type by the file extension: 


.vert Vertex shader 

.tccs Tessellation Control Shader 
.tecs Tessellation Evaluation Shader 
.geom Geometry shader 

frag Fragment shader 

.comp Compute shader 


72 SIGGRAPH THINK 


өз BEYOND 


mjb — July 24, 2020 


Running glslangValidator.exe 


sumpic-wvert.ver 


$ 186 


sample-trag.Trag 


MINGW64 /y/Vulkan/S5ample2017 
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How do you know if SPIR-V compiled successfully? 64 


Same as C/C++ -- the compiler gives you no nasty messages. 
Also, if you care, legal .spv files have a magic number of 0x07230203 


So, if you do an od —x on the .spv file, the magic number looks like this: 
0203 0723... 
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Reading a SPIR-V File into a Vulkan Shader Module 65 


#define SPIRV_MAGIC 0x07230203 
VkResult 
Init12SpirvShader( std::string filename, VkShaderModule * pShaderModule ) 
{ 
FILE *fp; 
(void) fopen s( &fp, filename.c str(), "rb"); 
if( fp == NULL ) 


fprintf( FoDebug, "Cannot open shader file '%s'\n", filename.c str( ) ); 
return V&K SHOULD EXIT; 


uint32 t magic; 
fread( &magic, 4, 1, fp ); 
if( magic != SPIRV MAGIC ) 


fprintf( FpDebug, "Magic number for spir-v file '%s is Ox%08x -- should be 0х%08х\п", 
filename.c str( ), magic, SPIRV MAGIC ); 
return VK SHOULD EXIT; 
} 


fseek( fp, OL, SEEK_END ); 

int size = ftell( fp ); 

rewind( fp ); 

unsigned char *code = new unsigned char [size]; 
fread( code, size, 1, fp ); 

fclose( fp ); 


THINK 


BEYOND mjb — July 24, 2020 


Reading a SPIR-V File into a Shader Module 66 


VkShaderModule ShaderModuleVertex; 


VkShaderModuleCreatelnfo 
vsmci.sType = УК STRUCTURE _ 
vsmci.pNext - nullptr; 
vsmci.flags = 0; 
vsmci.codeSize = size; 
vsmci.pCode = (uint32 t *)code; 


SHADER MODULE CREATE INFO; 


VkResult result = vkCreateShaderModule( LogicalDevice, &vsmci, PALLOCATOR, OUT & ShaderModuleVertex ); 
fprintf( FoDebug, "Shader Module '%s' successfully loaded", filename.c str() ); 

delete [ | code; 

return result; 
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Vulkan: Creating a Pipeline 67 


which stage (VERTEX, etc.) binding 
stride locati 
— ; ocation 
VkShaderModule | BEE binding 
- format 
offset 


VkSpecializationInfo 


VkVertexInputBindingDescription 


VkVertexInputAttributeDescription 


VkPipelineShaderStageCreatelnfo 


VkPipelineVertexInputStateCreatelnfo 


VkPipelinelnputAssemblyStateCreatelnfo 
X, y, w, h, 
VkViewportStateCreatelnfo minDepth, 
Rasterization State 


maxDepth 
"Hm — Scissor 
MultiSample State VkPipelineRasterizationStateCreatelnfo 


р == VkPipelineDepthStencilStateCreatelnfo cullMode 
olygonMode 
Dynamic State po Y9 
Pipeline layout 
RenderPass 
basePipelineHandle 
basePipelinelndex 


Topology 


Shader stages 
Vertexlnput State 
InputAssembly State 
Tesselation State 
Viewport State 


frontFace 
lineWidth 


depthTestEnable 
VkPipelineColorBlendStateCreatelnfo depthWriteEnable 


depthCompareOp 
VkGraphicsPipelineCreatelnfð stencilTestEnable 
stencilOpStateFront 
stencilOpStateBack 


VkPipelineColorBlendAttachmentState 


blendEnable 
srcColorBlendFactor 
dstColorBlendFactor 

colorBlendOp 
srcAlphaBlendFactor 
dstAlphaBlendFactor 
VkPipelineDynamicStateCreatelnfo alphaBlendOp 


colorWriteMask 


vkCreateGraphicsPipeline( ) 


m 
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You can also take a look at SPIR-V Assembly 68 


glslangValidator.exe м CH) sample-vert.vert -о sample-vert.spv 


7 


This prints out the SPIR-V “assembly” to standard output. 
Other than nerd interest, there is no graphics-programming reason to look at this. © 


- m 
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For example, if this is your Shader Source 69 


#version 400 

#extension GL_ARB_separate_shader_objects : enable 
#extension GL_ARB_shading_language_420pack : enable 
layout( std140, set = 0, binding = 0 ) uniform matBuf 

{ 


mat4 uModelMatrix; 
mat4 uViewMatrix; 
mat4 uProjectionMatrix; 
mat3 uNormalMatrix; 

} Matrices; 


Il non-opaque must be in a uniform block: 
layout( std140, set = 1, binding = 0 ) uniform lightBuf 


{ 
уес4 uLightPos; 
} Light; 


layout( location = 0 ) in vec3 aVertex; 
layout( location = 1 ) in vec3 aNormal; 
layout( location = 2 ) in vec3 aColor; 
layout( location = 3 ) in vec2 aTexCoord; 


layout ( location = 0 ) out vec3 vNormal; 
layout ( location = 1 ) out vec3 vColor; 
layout ( location = 2 ) out vec2 vTexCoord; 


void 
main( ) 


{ 


mat4 PVM = Matrices.uProjectionMatrix * Matrices.uViewMatrix * Matrices.uModelMatrix; 
gl_Position = PVM * vec4( aVertex, 1. ); 


vNormal = Matrices.uNormalMatrix * aNormal; 
vColor = aColor; 
vTexCoord = aTexCoord; 


© SIGGRAPH ТИ 
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This is the SPIR-V Assembly, Part | 


#version 400 
#extensionGL_ARB_separate_shader_objects : enable 
#extension GL_ARB_shading_language_420pack : enable 
layout( std140, set 0, binding = 0 ) uniform matBuf 

{ 


matá uModelMatrix: 
mat4 uViewMatrix: 
maté uProjectionMatrix; 
mat3 uNormalMatrix; 

} Matrices; 


11 non-opaque must be in a uniform block: 
layout( std140, set = 1, binding = O ) uniform lightBuf 


t 
vec4 uLightPos; 
} Light; 


layout( location = 0 ) in vec3 aVertex; 
layout( location = 1 ) in vec3 aNormal; 
layout( location = 2 ) in vec3 аСоюг; 
layout( location = 3 ) in vec2 aTexCoord; 


layout ( location = 0 ) out vec3 vNormal; 
layout ( location = 1 ) out vec3 vColor; 
layout ( location - 2 ) out vec2 vTexCoord; 


void 
main( ) 
{ 


mat4 PVM = Matrices.uProjectionMatrix * Matrices.uViewMatrix * Matrices.uModelMatrix; 
gl Position = PVM * vec4( aVertex, 1. ): 


vNormal = Matrices.uNormalMatrix * aNormal: 
vColor = aColor; 
vTexCoord = aTexCoord; 


THINK 


BEYOND 


Capability Shader 

Extinstimport "GLSL.std.450" 

MemoryModel Logical GLSL450 

EntryPoint Vertex 4 "main" 34 37 48 53 56 57 61 63 
Source GLSL 400 

SourceExtension "GL ARB separate shader objects" 


SourceExtension "GL ARB shading language 420pack" 
Name 4 "main" 
Name 10 "PVM" 
Name 13 "matBuf" 
MemberName 13(matBuf, 
MemberName 13(matBuf 
MemberName 13(matBuf 
MemberName 13(matBuf 
Name 15 "Matrices" 
Name 32 "gl PerVertex" 
MemberName 32(gl PerVertex) O "gl Position" 
MemberName 32(gl PerVertex) 1 "gl PointSize" 
MemberName 32(gl PerVertex) 2 "gl ClipDistance" 
Name 34 "" 
Name 37 "aVertex" 
Name 48 "vNormal" 
Name 53 "aNormal" 
Name 56 "vColor" 
Name 57 "aColor" 
Name 61 "vTexCoord" 
Name 63 "aTexCoord" 
Name 65 "lightBuf" 
MemberName 65(lightBuf) 0 "uLightPos" 
Name 67 "Light" 
MemberDecorate 13(matBuf) 0 ColMajor 
MemberDecorate 13(matBuf) 0 Offset 0 
MemberDecorate 13(matBuf) 0 MatrixStride 16 

( 

( 


0 "uModelMatrix" 

1 "uViewMatrix" 

2 "uProjectionMatrix" 
3 "uNormalMatrix" 


еа е» ы | 


MemberDecorate 13(matBuf) 1 ColMajor 
MemberDecorate 13(matBuf) 1 Offset 64 
MemberDecorate 13(matBuf) 1 MatrixStride 16 
MemberDecorate 13(matBuf) 2 ColMajor 
MemberDecorate 13(matBuf) 2 Offset 128 
MemberDecorate 13(matBuf) 2 MatrixStride 16 
MemberDecorate 13(matBuf) 3 ColMajor 
MemberDecorate 13(matBuf) 3 Offset 192 
MemberDecorate 13(matBuf) 3 MatrixStride 16 
Decorate 13(matBuf) Block 

Decorate 15(Matrices) DescriptorSet 0 


Шы 
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This is the SPIR-V Assembly, Part Il 


#version 400 

fextension GL ARB separate shader objects : enable 
#extension GL ARB shading language 420pack: enable 
layout( std140, set 0, binding = 0 ) uniform matBuf 

{ 


matá uModelMatrix: 
mat4 uViewMatrix: 
maté uProjectionMatrix; 
mat3 uNormalMatrix; 

} Matrices; 


11 non-opaque must be in a uniform block: 
layout( std140, set = 1, binding = O ) uniform lightBuf 
t 
vec4 uLightPos; 
} Light; 


layout( location = 0 ) in vec3 aVertex; 
layout( location = 1 ) in vec3 aNormal; 
layout( location = 2 ) in vec3 аСоюг; 
layout( location = 3) in vec2 aTexCoord; 


layout ( location = 0 ) out vec3 vNormal; 
layout ( location = 1 ) out vec3 vColor; 
layout ( location = 2 ) out vec2 vTexCoord; 


void 
main( ) 
{ 


mat4 PVM = Matrices.uProjectionMatrix * Matrices.uViewMatrix * Matrices.uModelMatrix; 
gl Position = PVM * vec4( aVertex, 1. ): 


vNormal = Matrices.uNormalMatrix * aNormal: 
vColor = aColor; 
vTexCoord = aTexCoord; 


THINK 


BEYOND 


32(gl PerVertex): 


Decorate 15(Matrices) Binding 0 
MemberDecorate 32(gl PerVertex) 0 BuiltIn Position 
MemberDecorate 32(gl PerVertex) 1 Вит PointSize 
MemberDecorate 32(gl PerVertex) 2 BuiltIn ClipDistance 
Decorate 32(gl PerVertex) Block 
Decorate 37(aVertex) Location 0 
Decorate 48(vNormal) Location 0 
Decorate 53(aNormal) Location 1 
Decorate 56(vColor) Location 1 
Decorate 57(aColor) Location 2 
Decorate 61(vTexCoord) Location 2 
Decorate 63(aTexCoord) Location 3 
MemberDecorate 65(lightBuf) 0 Offset 0 
Decorate 65(lightBuf) Block 
Decorate 67(Light) DescriptorSet 1 
Decorate 67(Light) Binding 0 
TypeVoid 
TypeFunction 2 
TypeFloat 32 
TypeVector 6(float) 4 
TypeMatrix 7(fvec4) 4 
TypePointer Function 8 
TypeVector 6(float) 3 
TypeMatrix 11(fvec3) 3 
TypeStruct 8 8 8 12 
TypePointer Uniform 13(matBuf) 
14(ptr) Variable Uniform 
Typelnt 32 1 
16(int) Constant 2 
TypePointer Uniform 8 
16(int) Constant 1 
16(int) Constant 0 
Typelnt 32 0 
29(int) Constant 1 
TypeArray 6(float) 30 
TypeStruct 7(fvec4) 6(float) 31 
TypePointer Output 32(gl PerVertex) 
33(ptr) Variable Output 
TypePointer Input 11(fvec3) 


37(aVertex): 36(ріг) Variable Input 


39: 6(float) Constant 1065353216 
TypePointer Output 7(fvec4) 
TypePointer Output 11(fvec3) 


48(vNormal): 47(рїг) Variable Output 


49:  16(int) Constant 3 
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This is the SPIR-V Assembly, Part Ill 


#version 400 
#extensionGL_ARB_separate_shader_objects : enable 
#extension GL_ARB_shading_language_420pack : enable 
layout( std140, set 0, binding = 0 ) uniform matBuf 

{ 


matá uModelMatrix: 
mat4 uViewMatrix: 
maté uProjectionMatrix; 
mat3 uNormalMatrix; 

} Matrices; 


11 non-opaque must be in a uniform block: 
layout( std140, set = 1, binding = O ) uniform lightBuf 


t 
vec4 uLightPos; 
} Light; 


layout( location = 0 ) in vec3 aVertex; 
layout( location = 1 ) in vec3 aNormal; 
layout( location = 2 ) in vec3 аСоюг; 
layout( location = 3) in vec2 aTexCoord; 


layout ( location = 0 ) out vec3 vNormal; 
layout ( location = 1 ) out vec3 vColor; 
layout ( location = 2 ) out vec2 vTexCoord; 


void 
main( ) 
{ 


gl Position = PVM * vec4( aVertex, 1. ): 
vNormal = Matrices.uNormalMatrix * aNormal: 


vColor = aColor; 
vTexCoord = aTexCoord; 


THINK 
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mat4 PVM = Matrices.uProjectionMatrix * Matrices.uViewMatrix * Matrices.uModelMatrix; 


50: 


TypePointer Uniform 12 


53(aNormal): 36(ріг) Variable Input 
56(vColor): 47(ptr) Variable Output 
57(aColor) 36(ptr) Variable Input 


59: TypeVector 6(float) 2 
60: TypePointer Output 59(fvec2) 
61(vTexCoord): 60(ріг) Variable Output 
62: TypePointer Input 59(fvec2) 
63(aTexCoord): 62(ріг) Variable Input 
65(lightBuf): TypeStruct 7(fvec4) 
66: TypePointer Uniform 65(lightBuf) 
67(Light): 66(ріг) Variable Uniform 
4(main): 2 Function None 3 
5: Label 
10(PVM):  9(ptr) Variable Function 
19:  18(ptr) AccessChain 15(Matrices) 17 
20: 8 Load 19 
22:  18(ptr) AccessChain 15(Matrices) 21 
23: 8 Load 22 
24: 8 MatrixTimesMatrix 20 23 
26:  18(ptr) AccessChain 15(Matrices) 25 
27 8 Load 26 
28 8 MatrixTimesMatrix 24 27 
Store 10(PVM) 28 
35: 8 Load 10(PVM) 
38: 11(fvec3) Load 37(aVertex) 
40: 6(float) CompositeExtract 38 0 
41: 6(float) CompositeExtract 38 1 
42: 6(float) CompositeExtract 38 2 
43: Т(Мес4) CompositeConstruct 40 41 42 39 
44: T(fvec4) MatrixTimesVector 35 43 
46: 45(ріг) AccessChain 34 25 
Store 46 44 
51: 50(ріг) AccessChain 15(Matrices) 49 
52: 12 Load 51 
54: 11(fvec3) Load 53(aNormal) 
55: 11(fvec3) MatrixTimesVector 52 54 
Store 48(vNormal) 55 
58: 11(fvec3) Load 57(aColor) 
Store 56(vColor) 58 
64: 59(fvec2) Load 63(aTexCoord) 


Store 61(vTexCoord) 64 
Return 
FunctionEnd 
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A Google-Wrapped Version of glslangValidator 73 


Тһе shaderc project from Google (htips://github.com/google/shaderc) provides 
a glslangValidator wrapper program called glslc that has a much improved 
command-line interface. You use, basically, the same way: 


gisic.exe -target-env-vulkan sample-vert.vert -o sample-vert.spv 
There are several really nice features. The two | really like are: 
1. You can #include files into your shader source 


2. You can “fdefine” definitions on the command line like this: 
gisic.exe --target-env=vulkan -DNUMPONTS=4 sample-vert.vert -o sample-vert.spv 


glsic is included in your Sample .zip file 
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Vulkan. 


Data Buffers 


Mike Bailey 


mjb@cs.oregonstate.edu 


http://cs.oregonstate.edu/~mjb/vulkan 
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From the Quick Reference Card 75 


Vulkan 1.1 Reference Guide 


Vulkan Pipeline Diagram [9] 


essellation Control Shader 


Some Vulkan commands specify geometric objects 
to be drawn or computational work to be performed, 
while others specify state controlling how objects 
are handled by the various pipeline stages, or control 
data transfer between memory organized as images 
and buffers. Commands are effectively sent through 
a processing pipeline, either a graphics pipeline or a 
compute pipeline. 


E] fixed Function Stage 
L shader Stage 
E13 Storage Images 
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Terminology Issues 76 


A Vulkan Data Buffer is just a group of contiguous bytes in GPU memory. They have no 
inherent meaning. The data that is stored there is whatever you want it to be. (This is 
sometimes called a “Binary Large Object”, or "BLOB".) 


It is up to you to be sure that the writer and the reader of the Data Buffer are interpreting 
the bytes in the same way! 


Vulkan calls these things “Buffers”. But, Vulkan calls other things “Buffers”, too, such as 
Texture Buffers and Command Buffers. So, | sometimes have taken to calling these things 
“Data Buffers” and have even gone to far as to override some of Vulkan’s own terminology: 
typedef VkBuffer VkDataBuffer; 


This is probably a bad idea in the long run. 
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Creating and Filling Vulkan Data Buffers d 


bufferUsage 
queueFamilyIndices 
size (bytes) 


LogicalDevice 


VkBufferCreatelnfo 


vkCreateBuffer( ) 


/ 


Buffer 


vkGetBufferMemoryRequirements( ) 


memoryType 


VkMemoryAllocatelnfo 


LogicalDevice vkAllocateMemory( ) 


bufferMeMoryHandle 


vkBindBufferMemory( ) 


vkMapMemory( ) 


gpuAddress 


=. m 
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Creating a Vulkan Data Buffer 


VkBuffer Buffer; 


VkBufferCreatelnfa 
vbci.sType = VK ST 
vbci.pNext = nullptr; 
vbci.flags = 0; 
vbci.size = << buffer sizà in bytes >> 
vbci.usage = <<or’ed bits\of: 

VK_USAGE_TRANSKER_SRC_BIT 
VK_USAGE_TRANSFER_DST_BIT 
VK_USAGE_UNIFORM_TEXEL_BUFFER_BIT 
VK_USAGE_STORAGE\ TEXEL_BUFFER_BIT 
VK_USAGE_UNIFORM_BUFFER_BIT 
VK_USAGE_STORAGE_RUFFER_BIT 
VK_USAGE_INDEX_BUFFER_BIT 
VK USAGE VERTEX_BUFFER_BIT 
VK USAGE INDIRECT ВОҚҒЕК ВІТ 
vbci.sharingMode = << one of: >> 
VK SHARING MODE EXCLUSIVE 
VK SHARING MODE CONCURRENT 
vbci.queueFamilyIndexCount = 0; 
vbci.pQueueFamilyIndices = (const iont32_t) nullptr; 


result = vkCreateBuffer ( LogicalDevice, IN &vbci, PALLOCATOR, OUT &Buffer ); 


UCTURE TYPE BUFFER CREATE INFO; 


78 
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Allocating Memory for a Vulkan Data Buffer, Binding a 79 
Buffer to Memory, and Writing to the Buffer 


VkMemoryRequirements 


result = vkGetBufferMemoryR SV ); 


VkMemoryAllocatelnfo 
vmai.sType = VK_STRWCTUR 
vmai.pNext = nullptr; 
vmai.flags = 0; 
vmai.allocationSize = vmr.size; 
vmai.memoryTypelndex = FindMemoryThatlsHostVisible( ); 


(YPE MEMORY. ALLOCATE INFO; 


VkDeviceMemory 
result = vkAllocateMemory( Log N &vmai, PALLOCATOR, OUT &vdm ); 
result = vkBindBufferMemory( Log Il 0 is the offset 


result = v«MapMemory( LogicalDevice, IN уат, 0, VK_WHOLE_ SIZE, 0, &ptr ); 


<< do the memory copy >> 
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АШ result = vkUnmapMemory( LogicalDevice, IN мат ); 


Finding the Right Type of Memory 80 


int 

FindMemoryThatlsHostVisible( ) 

{ 
VkPhysicalDeviceMemoryProperties vpdmp; 
vkGetPhysicalDeviceMemoryProperties( PhysicalDevice, OUT &vpdmp ); 
for( unsigned int i = 0; i < vpdmp.memoryTypeCount; i++ ) 


emon 


dmp.m pes 
MEMORY_ PROPERTY_ HOST VISIBLE BID != 0) 


VkMemoryType vmt = vp 
if( ( vmt.propertyFlags & 
{ 


O 


return i; 


} 


return -1; 


THINK 
© SIGGRAPH I 


mjb — July 24, 2020 


Finding the Right Type of Memory 81 


int 

FindMemoryThatlsDeviceLocal( ) 

{ 
VkPhysicalDeviceMemoryProperties vpdmp; 
vkGetPhysicalDeviceMemoryProperties( PhysicalDevice, OUT &vpdmp ); 
for( unsigned int i = 0; i < vpdmp.memoryTypeCount; i++ ) 


VkMemoryType vmt = vpdmp,memoryTypes}+ 
if( ( vmt.propertyFlags@ VK MEMORY _ PROPERTY _ DEVICE LOCAL БІТ >-0) 
{ 


return i; 


} 


return -1; 


2 THINK 
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Finding the Right Type of Memory 


H THINK 


VkPhysicalDeviceMemoryProperties vpdmp; 
vkGetPhysicalDeviceMemoryProperties( PhysicalDevice, OUT &vpdmp ); 


11 Memory Types: 
Memory 
Memory 
Memory 
Memory 
Memory 
Memory 
Memory 
Memory 7: DeviceLocal 

Memory 8: DeviceLocal 

Memory 9: HostVisible HostCoherent 

Memory 10: HostVisible HostCoherent HostCached 


posa те ае, 


2 Memory Heaps: 
Heap 0: size = 0xb7c00000 DeviceLocal 
Heap 1: size = 0xfac00000 


© SIGGRAP 
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Sidebar: The Vulkan Memory Allocator (VMA) 83 


The Vulkan Memory Allocator is a set of functions to simplify your view of 
allocating buffer memory. | don't have experience using it (yet), so I’m not in a 
position to confidently comment on it. But, | am including its github link here 
and a little sample code in case you want to take a peek. 


https://github.com/GPUOpen-LibrariesAndSDKs/VulkanMemoryAllocator 


This repository includes a smattering of documentation. 
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Sidebar: The Vulkan Memory Allocator (VMA) 84 


#define VMA_IMPLEMENTATION 
#include “vk_mem_alloc.h” 


VkBufferCreatelnfo vbci; 


VmaAllocationCreatelnfo vaci; 
vaci.physicalDevice = PhysicalDevice; 
vaci.device = LogicalDevice; 
vaci.usage = VMA_MEMORY_USAGE_GPU_ONLY; 


VmaAllocator var; 
vmaCreateAllocator( IN &vaci, OUT &var ); 


VkBuffer Buffer; 
VmaAllocation van; 
vmaCreateBuffer( IN var, IN &vbci, IN &vaci, OUT &Buffer. OUT &van, nullptr ); 


void *mappedDataAddr; 
vmaMapMemory( IN var, IN van, OUT &mappedDataAddr ); 

тетсру( mappedDataAddr, &MyData, sizeof(MyData) ); 
vmaUnmapMemory( IN var, IN van ); 


THINK 
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Something I’ve Found Useful 85 


| find it handy to encapsulate buffer information in a struct: 


typedef struct MyBuffer 

{ 
VkDataBuffer buffer; 
VkDeviceMemory vdm; 
VkDeviceSize size; 

} MyBuffer; 

MyBuffer MyMatrixUniformBuffer; 


Its the usual object-oriented benefit — you can pass around just one 
data-item and everyone can access whatever information they need. 


It also makes it impossible to accidentally associate the wrong 


VkDeviceMemory and/or VkDeviceSize with the wrong data buffer. 


я THINK 
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Initializing a Data Buffer 86 


It's the usual object-oriented benefit — you can pass around just one 
data-item and everyone can access whatever information they need. 


VkResult 
Init05DataBuffer( VkDeviceSize size, VkBufferUsageFlags usage, OUT MyBuffer * pMyBuffer ) 
{ 


vbci.size = pMyBuffer->size = size; 
result = vkCreateBuffer ( LogicalDevice, IN &vbci, PALLOCATOR, OUT &pMyBuffer->buffer ); 


pMyBuffer->vdm = vdm; 


72 SIGGRAPH THINK 
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Here's а C struct used by the Sample Code to hold some uniform variables 87 


struct matBuf 


{ 


glm::mat4 uModelMatrix; 
glm::mat4 uViewMatrix; 
glm::mat4 uProjectionMatrix; 
glm::mat3 uNormalMatrix; 

} Matrices; 


Here’s the associated GLSL shader code to access those uniform variables 


layout( std140, set = 0, binding = 0 ) uniform matBuf 
{ 


mat4 uModelMatrix; 


mat4 uViewMatrix; 
mat4 uProjectionMatrix; 
mat4 uNormalMatrix; 

} Matrices; 


e THINK 
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Filling those Uniform Variables 


uint32 t Height, Width; 
const double FOV = glm::radians(60.); // field-of-view angle in radians 


glm::vec3 eye(0.,0.,EYEDIST); 

glm::vec3 look(0.,0.,0.); 

glm::vec3 up(0.,1.,0.); 

Matrices.uModelMatrix = glm::mat4( 1. ); I] identity 


Matrices.uViewMatrix = glm::lookAt( eye, look, up ); 


Matrices.uProjectionMatrix = glm::perspective( FOV, (double)Width/(double)Height, 0.1, 1000. ); 
Matrices.uProjectionMatrix[1][1] *= -1.; Il account for Vulkan’s LH screen coordinate system 


Matrices.uNormalMatrix = glm::inverseTranspose( glm::mat3( Matrices.uModelMatrix ) ); 


© SIGGRAP 


This code assumes that this line: 


#define GLM_FORCE_RADIANS 


is listed before GLM is included! 


THINK 
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The Parade of Buffer Data 89 


MyBuffer = MyMatrixUniformBuffer; 


The MyBuffer does not hold any actual data itself. It just 


information about what is in the data buffer 


VkResult 
Init05DataBuffer( VkDeviceSize size, VkBufferUsageFlags usage, OUT MyBuffer * pMyBuffer ) 
{ 


vbci.size = pMyBuffer->size = size; 


result = vkCreateBuffer ( LogicalDevice, ІМ &vbci, PALLOCATOR, OUT &pMyBuffer->buffer ); 


pMyBuffer->vdm = vdm; 


This C struct is holding the 
original data, written by the 


application. — Memory. 
орет сору Тһе Data Buffer in GPU memory is holding the 
struct matBuf Matrices; copied data. It is readable by the shaders 


glm::vec3 eye(0.,0.,E YEDIST); 
glm::vec3 look(0.,0.,0.); 


glm::vec3 up(0.,1.,0.); uniform matBuf Matrices; 


Matrices.uModelMatrix = glm::mat4( ); // identity layout( std140, set = 0, binding = 0 ) uniform matBuf 


Matrices.uViewMatrix = glm::lookAt( eye, look, up ); { mat4 uModelMatrix: 


mat4 uViewMatrix; 
mat4 uProjectionMatrix; 
mat4 uNormalMatrix; 

} Matrices; 


Matrices .uProjectionMatrix = glm::perspective( FOV, (double)VVidth/(double)Height, 0.1, 1000. ); 
Matrices.uProjectionMatrix[1][1] *= -1.; 


г> € Matrices.uNormalMatrix = glm::inverseTranspose( glm::mat3( Matrices.uModelMatrix ) ); 
VIUUNHI Il пут 
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Filling the Data Buffer 


typedef struct MyBuffer 
{ 


VkDataBuffer buffer; 
VkDeviceMemory vdm; 
VkDeviceSize size; 

y MyBuffer; 


MyBuffer MyMatrixUniformBuffer; 


InitóbUniformBuffer( sizeof(Matrices), OUT &MyMatrixUniformBuffer ) 


FillobDataBuffer( MyMatrixUniformBuffer, ІМ (void *) &Matrices y 


gim::vec3 eye(0.,0.,EYEDIST); 
gim::vec3 look(0.,0.,0.); 
glm::vec3 up(0.,1.,0.); 


= glm::mat4( ); // identity 


Matrices.uViewMatrix = glm::lookAt( eye, look, ир); 


Matrices.uProjectionMatrix = glm::perspective( FOV, (double)Width/(double)Height, 0.1, 1000. ); 
Matrices.uProjectionMatrix[1][1] *= -1.; 


Matrices. uNormalMatrix = glm::inverseTranspose( glm::mat3( Matrices. uModelMatrix ) ); 
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Creating and Filling the Data Buffer — the Details 91 


VkResult 
Init05DataBuffer( VkDeviceSize size, VkBufferUsageFlags usage, OUT MyBuffer * pMyBuffer ) 


VkResult result = VK CCESS; 

VkBufferCreatelnf 
vbci.sType = VK 
vbci.pNext = nullptr; 
vbci.flags = 0; 
vbci.size = pMyBuffer->size * size; 
vbci.usage = usage; 
vbci.sharingMode = VK_SHARIN 
vbci.queueFamilyIndexCount = 0; 
vbci.pQueueFamilyIndices = (const uir&32 t *)nullptr; 

result = vkCreateBuffer ( LogicalDevice, IN &vbci, PALLOCATOR, OUT &pMyBuffer->buffer ); 


RÒCTURE_TYPE_BUFFER_CREATE_INFO; 


ODE_EXCLUSIVE; 


VkMemoryRequirements 


vkGetBufferMemoryRequirements(/LogicalDevice, IN pMyBuffer->buffer, OUT &vmr ); Il fills vmr 


VkMemoryAllocatelnfo 
vmai.sType = УК STRUC 
vmai.pNext - nullptr; 
vmai.allocationSize = vmr.size; 
vmai.memoryTypelndex = FindMemor 


ca PYPE_MEMORY_ALLOCATE_INFO; 


hatlsHostVisible( ); 


VkDeviceMemory 
result = vkAllocateMemor 
pMyBuffer->vdm = vdm; 


result = vkBindBufferMemory( LogicalDevice, pMyBuffer->buffer, IN vdm, OFFSET ZERO ); 


return result; 
} mjb — July 24, 2020 
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VkResult 
FillóbDataBuffer( IN MyBuffer myBuffer, IN 


{ 


Creating and Filling the Data Buffer — the Details 92 


// the size of the data had better match/fhe size that was used to Init the buffer! 


void * pGpuMemory; 
vkMapMemory( LogicalDeviceAN myBuffer.vdm, 0, УК WHOLE SIZ r&pGpuMemory J; 
О and 0 are offset and flags 


memcpy( pGpuMemory, data, (size_t)myBuffer.size ); 
vkUnmapMemory( LogicalDevice, IN myBuffer.vdm ); 
return VK_SUCCESS; 


Remember — to Vulkan and GPU memory, these are just bits. 
It is up to you to handle their meaning correctly. 


H THINK 
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Vulkan. 


GLFW 


Mike Bailey 


mjb@cs.oregonstate.edu 


http://cs.oregonstate.edu/~mjb/vulkan 
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Setting Up GLFW 94 


#define GLFW_INCLUDE_VULKAN 
#include "glfw3.h" 


uint32_t Width, Height; 
VkSurfaceKHR Surface; 
void 
InitGLFW( ) 
{ 
glfwinit( ); 


if( ! glfwVulkanSupported( ) ) 


fprintf( stderr, "Vulkan is not supported on this system!" ); 
exit( 1 ); 


} 

glfwWindowHint( GLFW CLIENT АРІ, GLFW NO АРІ); 

glfwWindowHint( GLFW RESIZABLE, GLFW FALSE ); 

MainWindow = glfwCreateWindow( Width, Height, "Vulkan Sample", NULL, NULL ); 
VkResult result = glfwCreateWindowSurface( Instance, MainWindow, NULL, OUT &Surface ); 


glfwSetErrorCallback( GLFWErrorCallback ); 

glfwSetKeyCallback( MainWindow, GLFWKeyboard ); 
glfwSetCursorPosCallback( MainWindow, GLFWMouseMotion ); 
glfwSetMouseButtonCallback( MainWindow, GLFWMouseButton ); 
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You Can Also Query What Vulkan Extensions GLFW Requires 95 


uint32 t count; 
const char ** extensions = glfwGetRequiredlnstanceExtensions (&count); 


fprintf( FoDebug, ^nFound %d GLFW Required Instance Extensions:\n", count ); 
for( uint32 t i= 0; i < count; i++ ) 


fprintf( FpDebug, "\t%s\n", extensions[ i |); 


Found 2 GLFW Required Instance 
Extensions: 


VK KHR surface 
VK KHR win32 surface 


я THINK 
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GLFW Keyboard Callback 96 


void 
GLFWKeyboard( GLFWwindow * window, int key, int scancode, int action, int mods ) 


{ 


if( action == GLFW PRESS) 


switch( key ) 
{ 
/ІСаве GLFW_KEY_M: 
case 'm': 
case "М" 
Mode++; 
if( Mode >= 2 ) 
Mode = 0; 
break; 


default: 


рип FoDebug, "Unknow key hit: 0x%04x = '9oc^n", key, key ); 
fflush(FpDebug); 


: BEYOND 
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GLFW Mouse Button Callback 97 


void 
GLFWMouseButton( GLFWwindow *window, int button, int action, int mods ) 


{ 
int = 0; II LEFT, MIDDLE, or RIGHT 


// get the proper button bit mask: 
switch( button ) 
( 
case GLFW MOUSE BUTTON LEFT: 
b-LEFT; break; 


case GLFW MOUSE BUTTON MIDDLE: 
b = MIDDLE; break; 


case GLFW_MOUSE_BUTTON_RIGHT: 
b = RIGHT; break; 


default: 
b=0; 
fprintf( FpDebug, "Unknown mouse button: %d\n", button ); 
} 


Il button down sets the bit, up clears the bit: 
if( action == GLFW_PRESS ) 
{ 


double xpos, ypos; 

glfwGetCursorPos( window, &xpos, &ypos); 

Xmouse = (int)xpos; 

Ymouse = (int)ypos; 

ActiveButton |= b; Il set the proper bit 
} 


else 


{ 
} 


ActiveButton &= ~b; Il clear the proper bit 


= THINK 
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GLFW Mouse Motion Callback 98 


void 

GLFWMouseMotion( GLFWwindow “window, double xpos, double уро ) 

{ 
int dx = (int)xpos - Xmouse; Il change in mouse coords 
int dy = (int)ypos - Ymouse; 


if( ( ActiveButton & LEFT ) !=0 ) 


{ 
Xrot += ( ANGFACT*dy ); 


Yrot += ( ANGFACT*dx ); 
} 


if( ( ActiveButton & MIDDLE ) != 0) 


{ 
Scale += SCLFACT * (float) ( dx - dy ); 


Il keep object from turning inside-out or disappearing: 


if( Scale < MINSCALE ) 
Scale = MINSCALE; 


} 


Xmouse = (int)xpos; Il new current position 
Ymouse = (int)ypos; 


THINK 
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Looping and Closing GLFW 99 


Does not block — 
processes any waiting events, then returns 


Il elapsed time, in double-precision seconds 


while( glfwWindowShouldClose( MainWindow ) == 


glfwPollEvents( ); 
Time = glfwGetTime( ); 
UpdateScene( ); 
RenderScene( ); 


} 


vkQueueWaitldle( Queue ); 
vkDeviceWaitldle( LogicalDevice ); 
DestroyAllVulkan( ); 
glfwDestroyWindow( MainWindow ); 
glfwTerminate( ); 


THINK 
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Looping and Closing GLFW 100 
If you would like to block waiting for events, use: 
glfwWaitEvents( ); 


You can have the blocking wake up after a timeout period with: 


glfwWaitEventsTimeout( double secs ); 


You can wake up one of these blocks from another thread with: 


glfwPostEmptyEvent( ); 


С) 
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What is GLM? 102 


GLM is a set of C++ classes and functions to fill in the programming gaps in writing 
the basic vector and matrix mathematics for OpenGL applications. However, even 
though it was written for OpenGL, it works fine with Vulkan. 


Even though GLM looks like a library, it actually isn’t — it is all 
specified in *.hpp header files so that it gets compiled in with your 


source code. . 
You can find it at: 


http://glm.g-truc.net/0.9.8.5/ 


OpenGL treats all angles as given in degrees. 
This line forces GLM to treat all angles as given 
iN radians. | recommend this so that all angles 
you create in a// programming will be in radians. 


You invoke GLM like this: 


define СІМ FORCE RADIAN 


#include «glm/glm.hpp»? 
#include «glm/gtc/matrix transform.hpp» 
#include «glm/gtc/matrix inverse.hpp» 


If GLM is not installed in a system place, put it somewhere you can get access to. 


H Tun 
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Why are we even talking about this? 


All of the things that we have talked about being deprecated in OpenGL are really 


deprecated in Vulkan -- built-in pipeline transformations, begin-end, fixed-function, etc. So, 
where you might have said in OpenGL: 


glMatrixMode( GL_MODELVIEW ); 

glLoadldentity( ); 

gluLookAt( 0., 0., 3., 0., 0., 0., 0., 1., 0. ); 

glRotatef( (GLfloat)Yrot, 0., 1., 0. ); 

glRotatef( (GLfloat)Xrot, 1., 0., 0. ); 

glScalef( (GLfloat)Scale, (GLfloat)Scale, (GLfloat)Scale ); 


you would now Say: 


glm::mat4 modelview = glm::mat4( 1. ); // identity 

glm::vec3 еуе(0.,0.,3.); 

glm::vec3 look(0.,0.,0.); 

glm::vec3 ир(0.,1.,0.); 

modelview = glm::lookAt( eye, look, up ); II {x’, y’,z’} = МҰх,у,2) 
modelview = glm::rotate( modelview, D2R*Yrot, glm::vec3(0.,1.,0.) ); // {х,у} = [v]*[yr]*{x, y,Z} 
modelview = glm::rotate( modelview, D2R*Xrot, glm::vec3(1.,0.,0.) ); // {x,y,z} = [v]*[yr]*[xr]*{x, y,z} 
modelview = glm::scale( modelview, glm::vec3(Scale,Scale,Scale) ); // {x,y,z} = [v]*[yr]*[xr]*[s]*{x, y,z} 


This is exactly the same concept as OpenGL, but a different expression of it. Read on for details ... 


THINK 
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The Most Useful GLM Variables, Operations, and Functions 104 


II constructor: 
glm::mat4( 1. ); Il identity matrix 


glm::vec4( ); 
glm::vec3( ); 


СІМ recommends that you use the *glm::" syntax and avoid “using namespace" 
syntax because they have not made any effort to create unique function names 

II multiplications: 

gim::mat4 * glm::mat4 


glm::mat4 * glm::vec4 
gim::mat4 * glm::vec4( glm::vec3, 1. ) Il promote а vec3 to a vec4 via a constructor 


II emulating OpenGL transformations with concatenation: 
glm::mat4 glm::rotate( glm::mat4 const & m, float angle, glm::vec3 const & axis ); 
glm::mat4 glm::scale( glm::mat4 const & т, glm::vec3 const & factors ); 


£2 SIGGRAPH ШІК glm::mat4 glm::translate( glm::mat4 const & т, glm::vec3 const & translation ); 
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The Most Useful GLM Variables, Operations, and Functions 105 


II viewing volume (assign, not concatenate): 


glm::mat4 glm::ortho( float left, float right, float bottom, float top, float near, float far ); 
glm::mat4 glm::ortho( float left, float right, float bottom, float top ); 


glm::mat4 glm::frustum( float left, float right, float bottom, float top, float near, float far ); 
glm::mat4 glm::perspective( float fovy, float aspect, float near, float far); 
II viewing (assign, not concatenate): 


gim::mat4 glm::lookAt( glm::vec3 const & eye, glm::vec3 const & look, glm::vec3 const & ир ); 


> THINK 
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Installing GLM into your own space 106 


| like to just put the whole thing under my Visual Studio project folder 
so | can zip up a complete project and give it to someone else. 


File Edit View Tools Help 
Organize v 24 Open ~ New folder 


> THN 


x Favorites 
№ Downloads 
$2 Dropbox 
Ë Engr Classes 
ЕЕ Engr Desktop 
(È) Engr Documents 
Ë Engr Home Director 
| Recent Places 


ігі Libraries 
В Documents 


а) Music 


ы Pictures 


Е Videos 


ІШ Computer 
& Local Disk (C:) 
68 түр (\guille\bailey\ 
GP apps (\\stak) (V:) 
G4 classes (\\farm) (W:) 
G2 mjb (\\stak.engr.ore 


Gu Network 


| Loe “ Sample.sin 


in] freeglut.h 
а freeglut.lib 
| freeglut_ext.h 
[t] freeglut std.h 
ih) glew.h 

72 glew32.lib 

|^] glut.h 

А libcd.lib 

tà LIBCID.LIB 
ej sample.cpp 
= Sample.dsp 
95 Sample.dsw 
а Sample.ncb 
|] Sample.opt 
1.1 Sample.plg 


> 


Date modified 


11/4/2015 3:52 PM 
11/4/2015 3:42 PM 
11/4/2015 3:40 PM 
4/2015 3:34 PM 
/2015 3:19 PM 
/2015 3:34 PM 
22/2015 8:45 AM 
/2015 3:27 PM 
/2015 3:27 PM 
/2015 3:34 PM 
10/9/2007 8:41 AM 
6/17/1998 1:00 AM 
11/4/2015 3:51 PM 
1/7/2002 11:31 AM 
1/7/2002 11:24 AM 
3/12/2010 11:17 AM 
12/30/2002 10:14 ... 
12/30/2002 10:12 ... 


Type 


File folder 

File folder 

File folder 

C/C++ Header 
Object File Library 
C/C++ Header 
C/C++ Header 
C/C++ Header 
Object File Library 
C/C++ Header 
Object File Library 
Object File Library 
C++ Source 
VC++ 6 Project 
VC++ 6 Workspace 


VC++ Intellisense... 


OPT File 
PLG File 


1 КВ 

39 KB 
11 KB 
27 KB 
977 KB 
583 KB 
1KB 
2,953 KB 
389 KB 
22 KB 

5 KB 
1KB 
14,043 KB 
48 KB 

2 KB 


118 Sample.sin 


1/5/2011 8:49 AM 


Microsoft Visual S... 


1KB| 


1 Sample.suo 


++ Sample.vcproj 


2 Sample.vcproj.EECS.mjb.user 


Sample.vcxproi 


Date modified: 1/5/2011 8:49 AM 


Microsoft Visual Studio Solution 


Size: 878 bytes 


22/2015 4:47 PM 
8/31/2009 12:42 PM 
2010 11:17 AM 

/4/2015 3:52 PM 


Date created: 11/4/2015 3:40 PM 


Visual Studio Solu... 


VC++ Project 
Visual Studio Proj... 


VC++ Project 


Offline availability: Not available 


17 KB 
6 KB 
2KB 
8 KB 
Offline status: Online 
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Here’s what that GLM folder looks like 107 


a š 
БЭН › Computer » тір (\\guille\bailey\users) (К) » CS550 » XA) 


File Edit View Tools Help 


Organize т Burn New folder 


Ft Favorites 
B Downloads 
$2 Dropbox 
y Engr Classes 
ШЕ Engr Desktop 
(È) Engr Documents 
Е Engr Home Director 


ІІ Recent Places 


Libraries 


Name 


J: detail 

J ste 

Шақ 

|) CMakeLists.tet 
[n] common.hpp 


[n] exponential.hpp 
in) et.hpp 

[t] fwd.hpp 

[n] geometric.hpp 


Date modified 


11/4/2015 3:42 PM 
11/4/2015 3:42 PM 
11/4/2015 3:42 PM 
11/4/2015 3:41 PM 
11/4/2015 3:41 PM 
11/4/2015 3:41 PM 
11/4/2015 3:41 PM 
11/4/2015 3:41 PM 
11/4/2015 3:41 PM 


Type 

File folder 

File folder 

File folder 

Text Document 
C/C++ Header 
C/C++ Header 
C/C++ Header 
C/C++ Header 
C/C++ Header 


Documents in] діт.һрр 11/4/2015 3:41 РМ С/С++ Header 
ІҢ integer.hpp 11/4/2015 3:41PM C/C++ Header 
[h] mat2x2.hpp 11/4/2015 3:41 PM С/С++ Header 
[n] mat2x3.hpp 11/4/2015 3:41 PM C/C++ Header 
im) mat2x4.hpp 11/4/2015 3:41 PM C/C++ Header 
ІШ Computer [h] mat3x2.hpp Type: C/C++ Header 
2 Size: 3.04 KB 
& Local Disk (C:) [n] mat3x3.hpp 
G mjb (\\guille\bailey) (п) mat3x4.hpp 7 i j 
GI apps (\\stak) (V:) [n] mat4x2.hpp 11/4/2015 3:41РМ C/C++ Header 
© classes (\\farm) (М) [n] mat4x3.hpp 11/4/2015 3:41РМ C/C++ Header 
99 mjb (\\stak.engr.ore [n] mat4x4.hpp 11/4/2015 3:41PM C/C++ Header 
[n] matrix.hpp 11/4/2015 3:41PM С/С++ Header 
@ Network P] packing.hpp 11/4/2015 3:41 PM С/С++ Header 
[n] trigonometric.hpp 11/4/2015 3:41 РМ С/С++ Header 
in] vec2.hpp 11/4/2015 3:41PM С/С++ Header 
[n] vec3.hpp 11/4/2015 3:41PM C/C++ Header 
[n] vec4.hpp 11/4/2015 3:41 PM С/С++ Header 
[h] vector relational.hpp 11/4/2015 3:41 РМ С/С++ Header 


27 items Offline status: Online 
Offline availability: Not available 
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GLM in the Vulkan sample.cpp Program 108 


if( UseMouse ) 
{ 
if( Scale < MINSCALE ) 
Scale = MINSCALE; 
Matrices.uModelMatrix = glm::mat4( 1. ); Il identity 
Matrices.uModelMatrix = glm::rotate( Matrices.uModelMatrix, Yrot, glm::vec3( 0.,1.,0.) ); 
Matrices.uModelMatrix = glm::rotate( Matrices.uModelMatrix, Xrot, glm::vec3( 1.,0.,0.) ); 
Matrices.uModelMatrix = glm::scale( Matrices.uModelMatrix, glm::vec3(Scale,Scale,Scale) ); 
Il done this way, the Scale is applied first, then the Xrot, then the Yrot 
} 


else 


if(! Paused ) 
{ 
const glm::vec3 axis = glm::vec3( 0., 1., 0. ); 
Matrices.uModelMatrix = glm::rotate( glm::mat4( 1. ), (float)glm::radians( 360.f*Time/SECONDS_PER_CYCLE ), axis ); 


} 


glm::vec3 eye(0.,0.,EYEDIST ); 

glm::vec3 look(0.,0.,0.); 

glm::vec3 up(0.,1.,0.); 

Matrices.uVewMatrix = glm::lookAt( eye, look, up ); 


Matrices.uProjectionMatrix = glm::perspective( FOV, (double)Width/(double)Height, 0.1f, 1000.f ); 
Matrices.uProjectionMatrix[1][1] *= -1.; II Vulkan’s projected Y is inverted from OpenGL 


Matrices.uNormalMatrix = glm::inverseTranspose( glm::mat3( Matrices.uModelMatrix ); // note: inverseTransform ! 
FillobDataBuffer( MyMatrixUniformBuffer, (void *) &Matrices ); 
Misc.uTime - (float)Time; 


Misc.uMode = Mode; 
Fill05DataBuffer( MyMiscUniformBuffer, (void *) &Misc ); 


62 THINK 
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Sidebar: Why Isn’t The Normal Matrix exactly the same as the Model Matrix? 


It is, if the Model Matrix is all rotations and uniform 
scalings, but if it has non-uniform scalings, then it is not. 
These diagrams show you why. 


Wrong! 


„7 


glm::mat3 NormalMatrix = glm::mat3(Model); 


AM Right! 
Original object and normal 


glm::mat3 NormalMatrix = glm::inverseTranspose( glm::mat3(Model) ); 
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Instancing — What and why? 111 


“ Instancing is the ability to draw the same object multiple times 
° |t uses all the same vertices and graphics pipeline each time 


e It avoids the overhead of the program asking to have the object drawn again, 
letting the GPU/driver handle all of that 


vkCmdDraw( CommandBuffers[nextlmagelndex], vertexCoun CinstanceCount, p stVertex, firstInstance ); 


But, this will only get us multiple instances of identical objects drawn on top of 
each other. How can we make each instance look differently? 


BTW, when not using instancing, be sure the instanceCount is 1, not 0! 


2 THINK 
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Making each Instance look differently -- Approach #1 112 


Use the built-in vertex shader variable gl_Instancelndex to define a unique display property, 
such as position or color. 


gl Instancelndex starts at 0 


In the vertex shader: 


out vec3 vColor; 
const int NUMINSTANCES = 16; 


const float DELTA = 3.0; 


float xdelta = DELTA * float( gl_Instancelndex % 4); 


float ydelta = DELTA * float( gl_Instancelndex / 4 ); 
vColor = vec3( 1., float( (1.*gl Instancelndex) ) / float( NUMINSTANCES ), 0. ); 


xdelta -= DELTA * sqrt( float(:NUMINSTANCES) ) / 2.; 
ydelta -= DELTA * sqrt( float(:NUMINSTANCES) ) / 2.; 
vec4 vertex = vec4( aVertex.xyz + vec3( xdelta, ydelta, O. ), 1. ); 


gl Position = PVM * vertex; Il [pl [М] [тп] 
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Making each Instance look differently -- Approach #2 114 


Put the unique characteristics in a uniform buffer array and reference them 
Still uses gl_Instancelndex 


In the vertex shader: 


layout( std140, set = 3, binding = 0 ) uniform colorBuf 


{ 
vec3 uColors[1024]; 
} Colors; 


out vec3 vColor; 


int index = gl Instancelndex % 1024; // or “& 1023" — gives 0 - 1023 
vColor = Colors.uColors[ index |; 


vec4 vertex =... 


gl_ Position = PVM * vertex; I! [p]*[v]*[m] 
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What is the Vulkan Graphics Pipeline? 


Don't worry if this is too small to read — a 


larger version is coming up. 


2. The Vulkan Graphics Pipeline is not the processes that OpenGL 
would call "the graphics pipeline". 


3. For the most part, the Vulkan Graphics Pipeline Data Structure is immutable — that is, once this combination of 
state variables is combined into a Pipeline, that Pipeline never gets changed. To make new combinations of state 
variables, create a new Graphics Pipeline. 


The shaders get compiled the rest of the way when their Graphics Pipeline gets created. 
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There is also a Vulkan Compute Pipeline Data Structure. 
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Graphics Pipeline Stages and what goes into Them 117 


The GPU and Driver specify the Pipeline Stages — the 
Vulkan Graphics Pipeline declares what goes in them 


Vertex Shader module 
Specialization info 
Vertex Input binding 
Vertex Input attributes 


| F, —Q O Q 


Topology — U 


Tessellation Shaders, Geometry Shader —n- 


Viewport 
Scissoring 


Depth Clamping 
DiscardEnable 
PolygonMode 
CullMode 
FrontFace 
LineWidth 


Which states are dynamic 


DepthTestEnable 

DepthWriteEnable 
DepthCompareOp 
StencilTestEnable 


Fragment Shader module 
Specialization info 


P uv 
c2 SIGGRAPH l Color Blending parameters 
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The First Step: Create the Graphics Pipeline Layout 118 


The Graphics Pipeline Layout is fairly static. Only the layout of the 
Descriptor Sets and information on the Push Constants need to be supplied. 


VkResult 
Init14GraphicsPipelineLayout( ) 


VkResult result; 


VkPipelineLayoutCreatelnfo әрісі) 
үрісіТуре = УК STRUCTURE. TYREAPIÉELINE LAYOUT CREATE INFO; 


Let the Pipeline Layout know about the 
Descriptor Set and Push Constant layouts. 


result - vkCreatePipelineLayout( LogicalDevice, IN &vplci, PALLOCATOR, OUT &GraphicsPipelineLayout ); 


1 als = Ba 


vplci. pushConstan angeCount = 0; 


vplci.pPushConstantRanges = (VkPushConstantRange *)nullptr; 


return result; 


Why is this necessary? It is because the Descriptor Sets and Push 
Constants data structures have different sizes depending on how many of 
each you have. So, the exact structure of the Pipeline Layout depends on 
€2 SIGGRAPH THINK you telling Vulkan about the Descriptor Sets and Push Constants that you 
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A Pipeline Data Structure Contains the Following State Items: ШЕ; 


* Pipeline Layout: Descriptor Sets, Push Constants 

• Which Shaders to use 

* Per-vertex input attributes: location, binding, format, offset 

* Per-vertex input bindings: binding, stride, inputRate 

* Assembly: topology 

• Viewport: x, у, w, h, minDepth, maxDepth 

* Scissoring: x, y, w, h 

* Rasterization: cullMode, polygonMode, frontFace, lineWidth 

* Depth: depthTestEnable, depthWriteEnable, depthCompareOp 

* Stencil: stencilTestEnable, stencilOpStateFront, stencilOpStateBack 

* Blending: blendEnable, srcColorBlendFactor, dstColorBlendFactor, colorBlendOp, 
srcAlphaBlendFactor, dstAlphaBlendFactor, alpghaBlendOp, colorWriteMask 

* DynamicState: which states can be set dynamically (bound to the command buffer, outside the Pipeline) 


Bold/Italics indicates that this state item can also be set with Dynamic State Variables 
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Creating a Graphics Pipeline from a lot of Pieces 


which stage (VERTEX, etc.) 


VkSpecializationInfo 
vkCreatePipelineLayout( ) 


VkPipelineShaderStageCreatelnfo 


VkPipelineVertexInputStateCreatelnfo Topology 
Shaders VkPipelinelnputAssemblyStateCreatelnfo 
VertexInput State 


binding 


VkPipelineLayoutCreatelnfo stride 


inputRate 


location 
binding 


VkShaderModule 


format 
offset 


VkVertexInputBindingDescription 


VkVertexInputAttributeDescription 


x, y, w, h, 

InputAssembly State Й 
Tesselation State VkViewportStateCreatelnfo Viewport minDepth, 
maxDepth 


Viewport State 
Rasterization State 
MultiSample State 


Scissor 


VkPipelineRasterizationStateCreatelnfo 


DepthStencil State 

2. State VkPipelineDepthStencilStateCreatelnfo ое 
Dynamic State polygonMode 
Pipeline layout frontFace 
RenderPass lineWidth 


basePipelineHandle 
basePipelinelndex 


depthTestEnable 

ШЕ VkPipelineColorBlendStateCreatelnfo depthWriteEnable 
depthCompareOp 

VkGraphicsPipelineCreatelnfo stencilTestEnable 


stencilOpStateFront 


stencilOpStateBack 


blendEnable 
srcColorBlendFactor 


VkPipelineColorBlendAttachmentState 
dstColorBlendFactor 
colorBlendOp 


vkCreateGraphicsPipeline( ) 
srcAlphaBlendFactor 


VkPipelineDynamicStateCreatelnfo dstAlphaBlendFactor 
alphaBlendOp 


THINK C| Graphics Pipeline p АА 
5.2 SIGGRAPH BEYOND Array naming the states that can be set dynamically ees 
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Creating a Typical Graphics Pipeline 121 


VkResult 
Init14GraphicsVertexFragmentPipeline( VkShaderModule vertexShader, VkShaderModule fragmentShader, 
VkPrimitive Topology topology, OUT VkPipeline *pGraphicsPipeline ) 


{ 
#ifdef ASSUMPTIONS 
vvibd[0].inputRate = VK VERTEX INPUT RATE VERTEX; 
vprsci.depthClampEnable = VK FALSE; 
vprsci.rasterizerDiscardEnable = VK FALSE; 
vprsci.polygonMode = VK POLYGON MODE FILL; 
vprsci.cullMode = МК CULL MODE NONE; //best to do this because of the projectionMatrix[1][1] *= -1.; 
vprsci.frontFace - VK FRONT FACE COUNTER CLOCKWISE; 
vpmsci.rasterizationSamples = УК SAMPLE COUNT ONE BIT; 
vpcbas.blendEnable = VK FALSE; 
vpcbsci.logicOpEnable = VK FALSE; 
vpdssci.depthTestEnable = VK TRUE; 
vpdssci.depthWriteEnable = VK TRUE; 
vpdssci.depthCompareOp = VK COMPARE OP LESS; 
#endif 


These settings seem pretty typical to me. Let’s write a simplified 
Pipeline-creator that accepts Vertex and Fragment shader modules 


and the topology, and always uses the settings in red above. 


- m 
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The Shaders to Use 122 


VkPipelineShaderStageCreatelnfo С үрззсі2і; > 
vpssci[0].sType = УК STRUCTURE TYPE-PIP E SHADER STAGE CREATE INFO; 

vpssci[0].pNext = nullptr; 

vpssci[0].flags = 0; 

vpssci[0].stage = VK_SHADER_STAGE_VERTEX_BIT; 


vpssci[0].module = vertexShader; 


vpssci[0].pName = "main"; : 
vpssci[0].pSpecializationInfo = (VkSpecializationInfo *)nullptr; Use one VDSSCI alley member per 
#ifdef BITS shader module you are using 


VK_SHADER_STAGE_VERTEX_BIT 
VK_SHADER_STAGE_TESSELLATION_CONTROL_BIT 
VK SHADER STAGE TESSELLATION EVALUATION BIT 
VK SHADER STAGE GEOMETRY BIT 

VK SHADER STAGE FRAGMENT BIT 

VK SHADER STAGE COMPUTE BIT 

VK SHADER STAGE ALL GRAPHICS 

VK SHADER STAGE ALL 


#endif 
vpssci[1].sType = V& STRUCTURE TYPE PIPELINE SHADER STAGE CREATE INFO; 
vpssci[1].pNext = nullptr; 
vpssci[1].flags = 0; 
vpssci[1].stage = УК SHADER STAGE FRAGMENT ВІТ; 
vpssci[1].module = fragmentShader; | 
vpssci[1].pName = "main"; Use one vvibd array member per vertex 
vpssci[1].pSpecializationInfo = (VkSpecializationInfo *)nullptr; input array-of-structures you are using 
VkVertexInputBindingDescription Il an array containing one of these per buffer being used 
vvibd[O].binding = 0; Il which Біп іт 15 
vvibd[O].stride = sizeof( struct vertex ); Il bytes between successive 
vvibd[0].inputRate = V&K VERTEX INPUT RATE VERTEX; 
#ifdef CHOICES 


VK VERTEX INPUT RATE VERTEX 
VK VERTEX INPUT RATE INSTANCE 
#endif 
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VkVertexInputAttributeDescription Ç ума]; > 
ture-eeerd 


114 = vertex, normal, color, tex зе 


// an array containing one of these per vertex attribute in all bindings 


vviad[0].location = 0; Il location in the layout E 

vviad[0].binding = 0; Il which binding description this is part of Use one vviad array member per element 
vviad[0].format = VK FORMAT VEC3; //x,y,z in the struct for the array-of-structures 
vviad[0].offset = offsetof( struct vertex, position ); ГО 


#ifdef EXTRAS_DEFINED_AT_THE_TOP element you are using as vertex input 


Il these are here for convenience and readability: 


#define VK_FORMAT_VEC4 VK_FORMAT_R32G32B32A32_SFLOAT 

#define VK FORMAT. XYZW VK FORMAT R32G32B32A32 SFLOAT 

#define VK FORMAT. VEC3 VK FORMAT R32G32B32 SFLOAT 

#define VK FORMAT. STP VK FORMAT. R32G32B32 SFLOAT These are defined at the top of the 
#define VK FORMAT. XYZ VK FORMAT R32G32B32 SFLOAT ; 

#define VK FORMAT. VEC2 VK FORMAT. R32G32 SFLOAT sample code o that you don t need to 
#define VK FORMAT. ST VK FORMAT. R32G32 SFLOAT use confusing image-looking formats 
#define VK FORMAT. XY VK FORMAT R32G32 SFLOAT iti 

#define VK FORMAT FLOAT VK_FORMAT_R32_SFLOAT for positions, normals, and tex coords 
#define VK FORMAT. S VK FORMAT R32 SFLOAT 

#define VK FORMAT. X VK FORMAT R32 SFLOAT 

#endif 


vviad[1].location = 1; 

vviad[1].binding = 0; 

vviad[1].format = VK_FORMAT_VEC3; Il nx, ny, nz 
vviad[1].offset = offsetof( struct vertex, normal ); Il 12 


vviad[2].location = 2; 

vviad[2].binding 7 0; 

vviad[2].format = VK FORMAT VEC3; Il r, g, b 
vviad[2].offset = offsetof( struct vertex, color ); I| 24 


vviad[3].location = 3; 

vviad[3].binding = 0; 

vviad[3].format = VK_FORMAT_VEC2; ШЕЛІ 
vviad[3].offset = offsetof( struct vertex, texCoord ); 
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// used to describe the input vertex attributes 
_VERTEX_INPUT_STATE_CREATE_INFO; 


VkPipelineVertexInputStateCreatelnfo С vpvisci > 
vpvisci.sType = УК STRUCTURE TYPE-RPIP q 
vpvisci.pNext = nullptr; 
vpvisci.flags = 0; T +t 
vpvisci.vertexBindingDescriptionCount = 1; Declare the binding descriptions and 
vpvisci.pVertexBindingDescriptions = vvibd; attribute descriptions 
vpvisci.vertexAttributeDescriptionCount = 4; 
vpvisci.pVertexAttributeDescriptions = vviad; 


VkPipelinelnputAssemblyStateCreatelnfo 
vpiasci.sType = УК STRUCTURE TYPE"PIP ЧЕ INPUT ASSEMBLY STATE CREATE INFO; 


vpiasci.pNext = nullptr; 

vpiasci.flags = 0; 

vpiasci.topology = VK PRIMITIVE TOPOLOGY TRIANGLE LIST;; 
#ifdef CHOICES 
VK PRIMITIVE TOPOLOGY POINT LIST 
VK PRIMITIVE TOPOLOGY LINE LIST Declare the vertex topology 
VK PRIMITIVE TOPOLOGY TRIANGLE LIST 
VK PRIMITIVE TOPOLOGY LINE STRIP 
VK PRIMITIVE TOPOLOGY TRIANGLE STRIP 
VK PRIMITIVE TOPOLOGY TRIANGLE FAN 
VK PRIMITIVE TOPOLOGY LINE LIST WITH ADJACENCY 
VK PRIMITIVE TOPOLOGY LINE STRIP WITH ADJACENCY 
VK PRIMITIVE TOPOLOGY TRIANGLE LIST WITH ADJACENCY 
VK PRIMITIVE TOPOLOGY TRIANGLE STRIP WITH ADJACENCY 
#endif 

vpiasci.primitiveRestartEnable = VK FALSE; 


VkPipelineTessellationStateCreatelnfo 
vptsci.sType = VK STRUCTURE TYPE PIiPEEHNETTESSELLATION STATE CREATE INFO; 
vptsci.pNext = nullptr; 
vptsci.flags = 0; Tessellation Shader info 
vptsci.patchControlPoints = 0; Il number of patch control points 

VkPipelineGeometryStateCreatelnfo 
vptsci.sType = VK STRUCTURE TYPE PIPEEHNECTESSELLATION STATE CREATE INFO; 


vptsci.pNext = nullptr; š 
dii =0; : Geometry Shader info 
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VK PRIMITIVE TOPOLOGY POINT LIST VK PRIMITIVE TOPOLOGY TRIANGLE LIST 
e у. 
@ v 
Vo @ V, Vo 


VK_PRIMITIVE_TOPOLOGY_TRIANGLE_STRIP 
VK_PRIMITIVE_TOPOLOGY_LINE_LIST = -= V - 2 


6 


VK_PRIMITIVE_TOPOLOGY_TRIANGLE_FAN 


V3 
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vpiasci.primitiveRestartEnable = VK_FALSE; 


“Restart Enable” is used with: 
* Indexed drawing. 
* Triangle Fan and *Strip topologies 


If vpiasci.primitiveRestartEnable is VK TRUE, then a special “index” indicates that the 
primitive should start over. This is more efficient than explicitly ending the current primitive 
and explicitly starting a new primitive of the same type. 


typedef enum VkIndexType 


VK INDEX TYPE UINT1670, /0- 65,535 
VK INDEX TYPE UINT32 7 1, // 0 — 4,294,967,295 
) VkIndexType; 


If your VkIndexType is VK INDEX TYPE UINT(16, then the special index is Oxffff. 
If your VkIndexType is УК INDEX TYPE UINT32, it is Oxffffffff. 
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One Really Good use of Restart Enable is in Drawing Terrain 127 
Surfaces with Triangle Strips 


Triangle Strip #0: 


Triangle stir #1: ТЫ ӘРЕ Е БАТУЫ 
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VkViewport 
w.x = 0; 
w.y = 0; 
vv.width = (float)Width; 
vv.height = (float)Height; 
vv.minDepth = 0.0f; 
vv.maxDepth = 1.0f; 


VkRect2D 
vr.offset.x = 0; 
vr.offset.y = 0; 
vr.extent.width = Width; 
vr.extent.height = Height; 


VkPipelineViewportStateCreatel nfo vpvsci; 
vpvsci.sType = VK_STRWCTURE_TYPE_PIPELINE_VIEWPORT_STATE_CREATE_INFO; 
vpvsci.pNext = nullptr; | 
vpvsci.flags = 0; 
vpvsci.viewportCount 
vpvsci.pViewports = &vv; 
vpvsci.scissorCount = 
vpvsci.pScissors = &vr; 
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What is the Difference Between Changing the Viewport and Changing the Scissoring? 129 


Viewport 

Viewporting operates on vertices and takes place 
right before the rasterizer. Changing the vertical part 
of the viewport causes the entire scene to get scaled 
(scrunched) into the viewport area. 


Original Image 


Scissoring: 

Scissoring operates on fragments and 
takes place right after the rasterizer. 
Changing the vertical part of the 
scissor causes the entire scene to get 
clipped where it falls outside the 
scissor area. 
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VkPipelineRasterizationStateCreatelnfo 
vprsci.sType = УК STRUCTURE TYPE PtP E RASTERIZATION STATE CREATE INFO; 


vprsci.pNext = nullptr; 
vprsci.flags = 0; Declare information about how the 
vprsci.depthClampEnable = VK FALSE; 
vprsci.rasterizerDiscardEnable = VK FALSE; 
vprsci.polygonMode = VK POLYGON MODE FILL; 

#ifdef CHOICES 

VK POLYGON MODE FILL 

VK POLYGON MODE LINE 

VK POLYGON MODE POINT 

#endif 


rasterization will take place 


vprsci.cullMode = VK_CULL_MODE_NONE; ІІ recommend this because of the projMatrix[1][1] *= - 
1: 
#ifdef CHOICES 
VK_CULL_MODE_NONE 
VK_CULL_MODE_FRONT_BIT 
VK_CULL_MODE_BACK_BIT 
VK_CULL_MODE_FRONT_AND_BACK_BIT 
#endif 


vprsci.frontFace = VK_FRONT_FACE_COUNTER_CLOCKWISE; 
#ifdef CHOICES 
VK_FRONT_FACE_COUNTER_CLOCKWISE 
VK_FRONT_FACE_CLOCKWISE 
#endif 

vprsci.depthBiasEnable = VK_FALSE; 

vprsci.depthBiasConstantFactor = O.f; 

vprsci.depthBiasClamp = O.f; 

vprsci.depthBiasSlopeFactor = 0.f; 

vprsci.lineWidth = 14; 
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What is “Depth Clamp Enable”? 131 


vprsci.depthClampEnable = VK_FALSE; 


Depth Clamp Enable causes the fragments that would normally have been 
discarded because they are closer to the viewer than the near clipping plane to 
instead get projected to the near clipping plane and displayed. 


А good use for this is Polygon Capping: 


The front of the polygon is clipped, The gray area shows what would 
revealing to the viewer that this is 
really a shell, not a solid 


happen with depthClampEnable 
(except it would have been red). 
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What is “Depth Bias Enable”? 


vprsci.depthBiasEnable = VK_FALSE; 
vprsci.depthBiasConstantFactor = 0.f; 
vprsci.depthBiasClamp = 0.f; 
vprsci.depthBiasSlopeFactor = 0.f; 


Depth Bias Enable allows scaling and translation of the Z-depth values as 
they come through the rasterizer to avoid Z-fighting. 


Z-fighting 
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VkPipelineMultisampleStateCreatelnfo Gomsei 
vpmsci.sType = VK_STRUCTURE_TYPE_PIPELINE_MULTISAMPLE_STATE_CREATE_INFO; 
vpmsci.pNext = nullptr; 
vpmsci.flags = 0; 
vpmsci.rasterizationSamples = VK SAMPLE COUNT 1 BIT; Declare information about how 
vpmsci.sampleShadingEnable = VK FALSE; the multisampling will take place 
vpmsci.minSampleShading = 0; 
vpmsci.pSampleMask = (VkSampleMask *)nullptr; 
vpmsci.alphaToCoverageEnable = VK FALSE; 
vpmsci.alphaToOneEnable = VK FALSE; 


We will discuss MultiSampling in a separate noteset. 
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Color Blending State for each Color Attachment * = 


Create an array with one of these for each color buffer attachment. Each 
color buffer attachment can use different blending operations. 


VkPipelineColorBlendAttachmentState 
vpcbas.blendEnable = VK_FALSE; 
vpcbas.srcColorBlendFactor = VK_BLEND_FACTOR_SRC_COLOR; 


vpcbas.colorBlendOp = VK BLEND OP ADD; 
qun re l BLEND FACTOEZ 


is controls blending between the output of 
ch color attachment and its image memory. 


VK COLCÓR COMPONENT А ВІТ; 


Colotnew —(1.-a) * Colorexisting + & * Colorincoming 
0.< а < 1. 


*A “Color Attachment” is a framebuffer to be rendered into. 
< SIGGRAPH m You can have as many of these as you want. acidi 


Raster Operations for each Color Attachment 


VkPipelineColorBlendStateCreatelnfo 


vpcbsci.sType = Cc D ETA 


vpcbsci.pNext = nullptr; 
vpcbsci.flags = 0; 
vpcbsci.logicOpEnable = VK_FALSE; 
vpcbsci.logicOp = VK_LOGIC_OP_COPY; 
#ifdef CHOICES 
VK_LOGIC_OP_CLEAR 
VK_LOGIC_OP_AND 
VK_LOGIC_OP_AND_REVERSE 
VK_LOGIC_OP_COPY 
VK_LOGIC_OP_AND_INVERTED 
VK_LOGIC_OP_NO_OP 
VK_LOGIC_OP_XOR 
VK_LOGIC_OP_OR 
VK_LOGIC_OP_NOR 
VK_LOGIC_OP_EQUIVALENT 
VK_LOGIC_OP_INVERT 
VK_LOGIC_OP_OR_REVERSE 
VK_LOGIC_OP_COPY_INVERTED 
VK_LOGIC_OP_OR_INVERTED 


VK_LOGIC_OP_NAND 
VK_LOGIC_OP_SET 
#endif 


vpcbsci.attachmentCount = 1; 
vpcbsci.pAttachments = &vpcbas; 
vpcbsci.blendConstants[0] = 0; 
vpcbsci.blendConstants[1] = 0; 
vpcbsci.blendConstants[2] = 0; 
vpcbsci.blendConstants[3] = 0; 


This controls blending between the 
output of the fragment shader and the 
input to the color attachments. 
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Just used as an example in the Sample Code 
VkDynamicState 


(vas <d VK DYNAMIC STATE VIEWPORT, VK_DYNAMIC_STATE_SCISSOR E> 
#ifdef CHOICES 


VK DYNAMIC STATE VIEWPOR -- vkCmdSetViewport( ) 
VK_DYNAMIC_STATE_SCISSOR -- vkCmdSetScissor( ) 


VK_DYNAMIC_STATE_LINE_WIDT -- vkCmdSetLineWidth( ) 

VK DYNAMIC STATE DEPTH ВІ -- vkCmdSetDepthBias( ) 

VK DYNAMIC STATE BLEND CONSTANTS -- vkCmdSetBendConstants( ) 

VK DYNAMIC STATE DEPTH BOWNDS -- vkCmdSetDepthZBounds( ) 

VK DYNAMIC. STATE STENCIL COMPARE MASK -- vkCmdSetStencilCompareMask( ) 
VK DYNAMIC STATE STENCIL WRITE MASK -- vkCmdSetStencilWriteMask( ) 
VK_DYNAMIC_STATE_STENCIL_ REFERENCE -- vkCmdSetStencilReferences( ) 


#endif 


VkPipelineDynamicStateCreatelhfo 
vpdsci.sType = VK_STRUCITURE_TYPE_PIPELTNE DYNAMIC STATE CREATE INFO; 


vpdsci.pNext = nullptr; 
vpdsci.flags = 0; 
vpdsci.dynamicStateCount $ 0; // leave turned off for now 
vpdsci.pDynamicStates = vds; 
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The Stencil Buffer 


Update 


Here’s how the Stencil Buffer works: 


1. While drawing into the Render Buffer, you can write values into the Stencil 
Buffer at the same time. 


2. While drawing into the Render Buffer, you can do arithmetic on values in the 
Stencil Buffer at the same time. 


3. When drawing into the Render Buffer, you can write-protect certain parts of the 
Render Buffer based on values that are in the Stencil Buffer 
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Using the Stencil Buffer to Create a Magic Lens 
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Using the Stencil Buffer to Create a Magic Lens id 


Clear the SB = 0 

Write protect the color buffer 

Fill a square, setting SB = 1 
Write-enable the color buffer 

Draw the solids wherever SB -- 

Draw the wireframes wherever SB == 1 


e e 
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Outlining Polygons the Naive Way 140 


SS 


1. Draw the polygons 
2. Draw the edges 


Z-fighting 
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Using the Stencil Buffer to Better Outline Polygons Lu 


{ 
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Using the Stencil Buffer to Better Outline Polygons е 


Clear the SB = 0 


for( each polygon ) 
{ 


Draw the edges, setting SB = 1 
Draw the polygon wherever SB != 1 
Draw the edges, setting SB = 0 


Before 
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Using the Stencil Buffer to Perform Hidden Line Removal 
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Stencil Operations for Front and Back Faces 144 


VkStencilOpState II front 


vsosf.depthFailOp = VK_STENCIL_OP=EEP; // what to do if depth operation fails 
vsosf.failOp = VK STENCIL OP KEEP; // what to do if stencil operation fails 
vsosf.passOp = VK STENCIL OP KEEP; // what to do if stencil operation succeeds 
#ifdef CHOICES 
VK_STENCIL_OP_KEEP -- keep the stencil value as it is 
VK_STENCIL_OP_ZERO -- set stencil value to 0 
VK_STENCIL_OP_REPLACE -- replace stencil value with the reference value 


VK STENCIL OP INCREMENT AND CLAMP  -- increment stencil value 
VK STENCIL OP DECREMENT AND CLAMP  --decrement stencil value 


VK STENCIL OP INVERT -- bit-invert stencil value 
VK STENCIL OP INCREMENT AND WRAP -- increment stencil value 
VK_STENCIL_OP_DECREMENT_AND_WRAP -- decrement stencil value 
#endif 

vsosf.compareOp = VK_COMPARE_OP_NEVER; 
#ifdef CHOICES 
VK_COMPARE_OP_NEVER -- never succeeds 
VK_COMPARE_OP_LESS -- succeeds if stencil value is < the reference value 
VK_COMPARE_OP_EQUAL -- succeeds if stencil value is == the reference value 
VK_COMPARE_OP_LESS_OR_EQUAL -- succeeds if stencil value is <= the reference value 
VK_COMPARE_OP_GREATER -- succeeds if stencil value is > the reference value 
VK_COMPARE_OP_NOT_EQUAL -- succeeds if stencil value is != the reference value 
VK_COMPARE_OP_GREATER_OR_EQUAL -- succeeds if stencil value is >= the reference value 
VK_COMPARE_OP_ALWAYS -- always succeeds 
#endif 


vsosf.compareMask = ~0; 
vsosf.writeMask = ~0; 
vsosf.reference = 0; 


VkStencilOpState Il back 
vsosb.depthFailOp = VK_STENCIL_ORe P; 


vsosb.failOp = VK STENCIL OP KEEP; 
vsosb.passOp - VK STENCIL OP KEEP; 
vsosb.compareOp = VK COMPARE OP NEVER; 
vsosb.compareMask = ~0; 

vsosb.writeMask = ~0; 


2 THINK vsosb.reference = 0; 
= шш “ы тр — July 24, 2020 


Operations for Depth Values 145 


VkPipelineDepthStencilStateCreatelnfo Cvpdssci; ) 
vpdssci.sType = VK STRUCTURE TYPE PIPEHNE-DEPTH STENCIL STATE CREATE INFO; 


vpdssci.pNext = nullptr; 

vpdssci.flags = 0; 

vpdssci.depthTestEnable = VK_TRUE; 

vpdssci.depthWriteEnable = VK_TRUE; 

vpdssci.depthCompareOp = VK_COMPARE_OP_LESS; 
VK_COMPARE_OP_NEVER -- never succeeds 
VK_COMPARE_OP_LESS -- succeeds if new depth value is < the existing value 
VK_COMPARE_OP_EQUAL -- succeeds if new depth value is == the existing value 
VK_COMPARE_OP_LESS_OR_EQUAL -- succeeds if new depth value is <= the existing value 
VK_COMPARE_OP_GREATER -- succeeds if new depth value is > the existing value 
VK_COMPARE_OP_NOT_EQUAL -- succeeds if new depth value is != the existing value 
VK_COMPARE_OP_GREATER_OR_EQUAL -- succeeds if new depth value is >= the existing value 
VK_COMPARE_OP_ALWAYS -- always succeeds 
#endif 

vpdssci.depthBoundsTestEnable = VK_FALSE; 

vpdssci.front = vsosf; 

vpdssci.back = vsosb; 

vpdssci.minDepthBounds = 0.; 

vpdssci.maxDepthBounds = 1.; 

vpdssci.stencilTestEnable = VK_FALSE; 
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VkPipeline GraphicsPipeline; 


VkGraphicsPipelineCreatelnfo «СТ» 
vgpci.sType = УК STRUCTURE TYPE GRARHICS PIPELINE CREATE INFO; 


vgpci.pNext = nullptr; 

vgpci.flags = 0; 
#ifdef CHOICES 
VK_PIPELINE_CREATE_DISABLE_OPTIMIZATION_BIT 
VK PIPELINE CREATE ALLOW DERIVATIVES ВІТ 
VK PIPELINE CREATE DERIVATIVE BIT 
#endif 


Group all of the individual state 
information and create the pipeline 


vgpci.stageCount = 2; Il number of stages іп 
vgpci.pStages = vpssci; 

vgpci.pVertexInputState = &vpvisci; 
vgpci.plnputAssemblyState = &vpiasci; 
vgpci.pTessellationState = (VkPipelineTessellationStateCreatelnfo *) 
vgpci.pViewportState = &vpvsci; 

vgpci.pRasterizationState = &vprsci; 

vgpci.pMultisampleState = &vpmsci; 

vgpci.pDepthStencilState = &vpdssci; 

vgpci.pColorBlendState = &vpcbsci; 

vgpci.pDynamicState = &vpdsci; 

vgpci.layout = IN GraphicsPipelineLayout; 

vgpci.renderPass = IN RenderPass; 

vgpci.subpass = 0; // subpass number 
vgpci.basePipelineHandle = (VkPipeline) VK_NULL_HANDLE; 
vgpci.basePipelineIndex = 0; 


is pipeline 


При; 


result = vkCreateGraphicsPipelines( LogicalDevice, VK_ NULL. HANDLE, 
PALLOCATOR, OUT &Grap 
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Later on, we will Bind a Specific Graphics Pipeline Data Structure to 
the Command Buffer when Drawing 
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Sidebar: What is the Organization of the Pipeline Data Structure? 148 


If you take a close look at the pipeline data structure creation information, you will see that almost all 
the pieces have a fixed size. For example, the viewport only needs 6 pieces of information — ever: 
VkViewport vv; 

vV.x = 0; 

vv.y = 0; 

vv.width = (float)Width; 

vv.height = (float)Height; 

vv.minDepth = 0.0f; 

vv.maxDepth = 1.0f; 
There are two exceptions to this -- the Descriptor Sets and the Push Constants. Each of these two 
can be almost any size, depending on what you allocate for them. So, | think of the Pipeline Data 
Structure as consisting of some fixed-layout blocks and 2 variable-layout blocks, like this: 


Fixed-layout Pipeline Blocks Variable-layout 


Pipeline Blocks 


© 
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Vulkan. 


Descriptor Sets 


Mike Bailey 


mjb@cs.oregonstate.edu 


http://cs.oregonstate.edu/~mjb/vulkan 
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In OpenGL 150 


OpenGL puts all uniform data in the same “set”, but with different binding numbers, so you 
can get at each one. 


Each uniform variable gets updated one-at-a-time. 


Wouldn't it be nice if we could update a collection of related uniform variables all at once, 
without having to update the uniform variables that are not related to this collection? 


layout( std140, binding = 0 ) uniform mat4 uModelMatrix; 
layout( std140, binding = 1 ) uniform mat4 uViewMatrix; 
layout( std140, binding = 2 ) uniform mat4 uProjectionMatrix; 
layout( std140, binding = 3 ) uniform mat3 uNormalMatrix; 
layout( std140, binding 7 4 ) uniform vec4 uLightPos; 
layout( std140, binding = 5 ) uniform float uTime; 

layout( std140, binding = 6 ) uniform int uMode; 

layout( binding = 7 ) uniform sampler2D uSampler; 
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What are Descriptor Sets? 


Descriptor Sets are an intermediate data structure that tells shaders how to connect 
information held in GPU memory to groups of related uniform variables and texture sampler 
declarations in shaders. There are three advantages in doing things this way: 


* Related uniform variables can be updated as a group, gaining efficiency. 


* Descriptor Sets are activated when the Command Buffer is filled. Different values for the 
uniform buffer variables can be toggled by just swapping out the Descriptor Set that 
points to GPU memory, rather than re-writing the GPU memory. 


* Values for the shaders' uniform buffer variables can be compartmentalized into what 
quantities change often and what change seldom (scene-level, model-level, draw-level), 
so that uniform variables need to be re-written no more often than is necessary. 

for( each scene ) 


{ 
Bind Descriptor Set #0 


for( each object ) 


Bind Descriptor Set #1 
for( each draw ) 


Bind Descriptor Set #2 
Do the drawing 
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Our example will assume the following shader uniform variables: 


Il non-opaque must be in a uniform block: 

layout( std140, set = 0, binding = 0 ) uniform matBuf 

{ 
mat4 uModelMatrix; 
mat4 uViewMatrix; 
mat4 uProjectionMatrix; 
mat3 uNormalMatrix; 

} Matrices; 


layout( std140, set = 1, binding = 0 ) uniform lightBuf 


{ 
vec4 uLightPos; 


} Light; 


layout( std140, set = 2, binding = 0 ) uniform miscBuf 
float uTime; 
int uMode; 

} Misc; 


layout( set = 3, binding = 0 ) uniform sampler2D uSampler; 
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GPU: 


Uniform data in a “Мор” 


GPU: 


Uniform data used 
in the shader 


CPU: 


Uniform data created in 


a C++ data structure 


° Knows the CPU data structure ° Knows where the data starts 

o ° Knows the shader data structure 
° Knows where the data starts ° Knows the data's size Doesn't know where eachipiece обов stare 
° Knows the data's size ° Doesn't know the CPU or GPU data structure P 


struct matBuf 


10111001010101111101000 
10000101110110101110100 


layout( std140, set = 0, binding = 0 ) uniform matBuf 


{ 11011001100000011101011 ( 

s 2; 11001110110100110010111 жағы 
glm::mat4 uModelMatrix; 11040141001401301010000 mat4 uModelMatrix; 
glm::mat4 uViewMatrix; 01001000111101000100101 mat4 uViewMatrix; 

: : : 01010100111111001000011 : : А 
glm::mat4 uProjectionMatrix; 10010101010011000110110 mat4 uProjectionMatrix; 
glm::mat3 uNormalMatrix; 18110111110111111111100 mat3 uNormalMatrix; 

А 01010000101110110 қ | 
5 000110100010100111 ICES; 


struct lightBuf 


01110101110100110001110 
10001010001111010111101 
10111010010010001101011 


layout( $19140, set = 1, binding = 0 ) uniform lightBuf 


{ 00000001111100011000010 { 
: 00001101100111010100011 : 
glm::vec4 uLightPos; 10100011001100110010000 vec4 uLightPos; 
y 11000011001111001001111 } Light; 
у 01001000100101100111000 , 
101110000001010000101 11 
struct miscBuf 1.1... layout( std140, set = 2, binding = 0 ) uniform miscBuf 
{ 11110111011111111011111 { 
A 10100111101111010111100 ii 
float uTime; 10101000000111100100110 float uTime; 
int uMode: 01110011111010010110011 int uMode: 
i 10110011100011011000111 . i 
10110000111110001110010 } Misc; 


€ SIGGRAPH THINK 


sone BEYOND 


* “binary large object” 


11001001110011010111011 
10010100 


layout( set = 3, binding = 0 ) uniform sampler2D uSampler; 
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Step 1: Descriptor Set Pools 


You don't allocate Descriptor Sets on the fly — that is too slow. 


Instead, you allocate a “pool” of Descriptor Sets and then pull from that pool later. 


flags maxSets poolSizeCount poolSizes 


Lo 


| VkbescriptorPooiCreatelnto | | VkbescriptorPooiCreatelnto | 


vkCreateDescriptorPool( ) 


DescriptorSetPool 


THINK 
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155 
VkResult 
Init13DescriptorSetPool( ) 


VkResult result; 


VkDescriptorPoolSize «СТІ» 


vdps[0].type = VK_DESCRIPTOR— NIFORM_BUFFER; 
vdps[0].descriptorCount = 1; 
vdps[1].type = VK_DESCRIPTOR_JYPE_UNIFORM_BUFFER; 
vdps[1].descriptorCount = 1; 
vdps[2].type = VK_DESCRIPTOR_JTYPE_UNIFORM_BUFFER; 
vdps[2].descriptorCount = 1; 
vdps[3].type = VK_DESCRIPTORJTYPE_COMBINED_IMAGE_SAMPLER; 
vdps[3].descriptorCount = 1; 


#ifdef CHOICES 
VK_DESCRIPTOR_TYPE_SAMPLER 
VK_DESCRIPTOR_TYPE_SAMPLED_IMAGE 
VK_DESCRIPTOR_TYPE_COMBINED_IMAGE_SAMPLER 
VK_DESCRIPTOR_TYPE_STORAGE_INAGE 

VK DESCRIPTOR TYPE UNIFORM TÉXEL BUFFER 
VK DESCRIPTOR TYPE STORAGE ТЕХЕІ BUFFER 
VK DESCRIPTOR TYPE UNIFORM HUFFER 

VK DESCRIPTOR TYPE STORAGE BUFFER 

VK DESCRIPTOR TYPE UNIFORM BUFFER DYNAMIC 
VK DESCRIPTOR TYPE STORAGE |BUFFER DYNAMIC 


#endif 


VkDescriptorPoolCreatelnfo 
CRIPTOR POOL CREATE INFO; 
vdpci.pNext = nullptr; 
vdpci.flags = 0; 
vdpci.maxSets = 4; 
vdpci.poolSizeCount = 4; 
vdpci.pPoolSizes = &vdps[0]; 


result = vkCreateDescriptorPool( LogicalDevice, IN &vdpci, PALLOCATOR, OUT &DescriptorPool); 
return result; 


>, THINK 
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Step 2: Define the Descriptor Set Layouts 156 


I think of Descriptor Set Layouts as a kind of “Rosetta Stone" that allows the Graphics 
Pipeline data structure to allocate room for the uniform variables and to access them. 
y г ol "215277417! 


m= (C Ty Па 


layout( std140, set = 0, binding = 0 ) uniform matBuf 
{ 

mat4 uModelMatrix; 

mat4 uViewMatrix; 

mat4 uProjectionMatrix; 

mat3 uNormalMatrix; 
} Matrices; 


ayout( std140, set = 1, binding = 0 ) uniform lightBuf 


1 
vec4 uLightPos; 


) Light; 


layout( std140, set = 2, binding = 0 ) uniform miscBuf 


float uTime; 
int. uMode; 


https://www.britishmuseum.org — Misc: 
E? set = 3, binding = 0 ) uniform sampler2D uSampler; 


MatrixSet DS Layout Binding: ^ LightSet DS Layout Binding: MiscSet DS Layout Binding: TexSamplerSet DS Layout Binding: 


binding binding binding binding 
descriptorType descriptorType descriptorType descriptorType 


descriptorCount descriptorCount descriptorCount descriptorCount 
pipeline stage(s) pipeline stage(s) pipeline stage(s) pipeline stage(s) 


set- 0 set - 1 set = 2 set = З 


£2 SIGGRAPH HN 
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VkResult 
Init13DescriptorSetLayouts( ) 
{ 


VkResult result; 


IDS #0: 

VkDescriptorSetLayoutBinding MatrixSet[1]; 
MatrixSet[0].binding =0; 
MatrixSet[0].descriptorType = VK DESCRIPTOR TYPE UNIFORM BUFFER; 
MatrixSet[0].descriptorCount = 1; 
MatrixSet[0].stageFlags - VK SHADER STAGE VERTEX BIT; 
MatrixSet[0].plmmutableSamplers = (VkSampler *)nullptr; 


I| DS #1: 
VkDescriptorSetLayoutBinding LightSet[1]; 
LightSet[0].binding = 0; 
LightSet[0].descriptorType =VK_DESCRIPTOR_TYPE_UNIFORM_BUFFER; 
LightSet[0].descriptorCount = 1; 
LightSet[0].stageFlags = VK SHADER STAGE VERTEX ВІТ | VK_SHADER_STAGE_FRAGMENT_BIT; 
LightSet[0].plmmutableSamplers = (VkSampler *)nullptr; 


10$ #2: 
VkDescriptorSetLayoutBinding MiscSet[1]; 
MiscSet[O].binding = 0; 
MiscSet[0].descriptorType = VK_DESCRIPTOR_TYPE_UNIFORM_BUFFER; 
MiscSet[0].descriptorCount = 1; 
MiscSet[0].stageFlags = VK SHADER STAGE VERTEX ВІТ | VK_SHADER_STAGE_FRAGMENT_BIT; 
MiscSet[0].plmmutableSamplers = (VkSampler *)nullptr; 


II DS #3: 

VkDescriptorSetLayoutBinding TexSamplerSet[1]; 

TexSamplerSet[0].binding = 0; 

TexSamplerSet[0].descriptorType =VK_DESCRIPTOR_TYPE_COMBINED_IMAGE_SAMPLER; 
TexSamplerSet[0].descriptorCount = 1; - 
TexSamplerSet[0].stageFlags = VK SHADER STAGE FRAGMENT BIT; uniform sampler2D uSampler; 
TexSamplerSet[0].plmmutableSamplers = (VkSampler *)nullptr; vec4 rgba = texture( uSampler, vST ); 


157 
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Step 2: Define the Descriptor Set Layouts 158 


MatrixSet DS Layout Binding: LightSet DS Layout Binding: — MiscSet DS Layout Binding: TexSamplerSet DS Layout Binding: 


binding binding binding binding 
descriptorType descriptorType descriptorType descriptorType 
descriptorCount descriptorCount descriptorCount descriptorCount 
pipeline stage(s) pipeline stage(s) pipeline stage(s) pipeline stage(s) 
set - 0 set - 1 set = 2 set = З 
vdslc0 DS Layout СІ: vdsic1 DS Layout СІ: vdslc2 DS Layout СІ: vdslc3 DS Layout СІ: 


bindingCount bindingCount bindingCount bindingCount 


type type type type 
number of that type number of that type number of that type number of that type 


pipeline stage(s) pipeline stage(s) pipelingstage(s) pipeline stage(s) 


Array of Descriptor 
Set Layouts 


Pipeline Layout 


H THINK 
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159 


È 


VkDescriptorSetLayoutCreatelnfo < 
vdslc0.sType = VK STRUCTURE TYPE-DESCRIPTOR SET LAYOUT CREATE INFO; 


vdslcO.pNext = nullptr; 

vdsic0.flags = 0; 
vdslcO.bindingCount = 1; 
vdslcO.pBindings = &MatrixSet[0]; 


VkDescriptorSetLayoutCreatelnfo < 
vdslc1.sType = VK_STRUCTURE_TYPE~DESCRIPTOR_SET_LAYOUT_CREATE_INFO; 


vdslc1.pNext = nullptr; 

vdslc1 flags = 0; 
vdslc1.bindingCount = 1; 
vdslc1.pBindings = &LightSet[0]; 


È 


VkDescriptorSetLayoutCreatelnfo 
vdslc2.sType = VK STRUCTURE TYPE-B RIPTOR SET LAYOUT CREATE INFO; 
vdslc2.pNext = пиріг; 
vdslc2.flags = 0; 
vdslc2.bindingCount = 1; 
vdslc2.pBindings = &MiscSet[0]; 


L 


VkDescriptorSetLayoutCreatelnfo 
vdsic3.sType = VK STRUCTURE TYPE-BPESCRIPTOR SET LAYOUT CREATE INFO; 
vdslc3.pNext = nullptr; 
vdslc3.flags = 0; 
vdslc3.bindingCount = 1; 
vdslc3.pBindings = &TexSamplerSet[0]; 


Ë 


result = vkCreateDescriptorSetLayout( LogicalDevice PALLOCATOR, OUT &DescriptorSetLayouts[0] ); 
result = vkCreateDescriptorSetLayout( LogicalDevice/ IN &vdslc1, PALLOCATOR, OUT &DescriptorSetLayouts[1] ); 
) 
) 


result = vkCreateDescriptorSetLayout( LogicalDevicd, IN &vdslc2, PALLOCATOR, OUT &DescriptorSetLayouts[2] 
result = vkCreateDescriptorSetLayout( LogicalDeviceNN &vdslc3f PALLOCATOR, OUT &DescriptorSetLayouts[3] 


return result; 


m 
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Step 3: Include the Descriptor Set Layouts in a Graphics Pipeline Layout 160 


VkResult 
Init14GraphicsPipelineLayout( ) 


{ 
VkResult result; 


VkPipelineLayoutCreatelnfo С vplci;> 
vplci.s Type = УК STRUCTURE TYPE CIPELINE LAYOUT CREATE INFO; 


vplci.pNext = nullptr; 
vplci.flags = 0; 
vpici. n = 4; 


vplci. pushConstan 


4 š 
vplci.pPushConstantRanges = (VkPushConstant ange *)nullptr; 


result = vkCreatePipelineLayout( LogicalDevice, IN &vplci, PALLOCATOR, OUT &GraphicsPipelineLayout ); 


return result; 


THINK 
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Step 4: Allocating the Memory for Descriptor Sets 161 


DescriptorSetPool 
descriptorSetCount DescriptorSetLayouts 


VkDescriptorSetAllocatelnfo 


vkAllocateDescriptorSets( ) 


Descriptor Set 


| THINK 
£2 SIGGRAPH HN 


mjb — July 24, 2020 


Step 4: Allocating the Memory for Descriptor Sets 


VkResult 
Init3DescriptorSets( ) 


VkResult result; 


VkDescriptorSetAllocateInfo «ОЭ, 
vdsai.sType = УК 5ТКОСТОКЕ_ТҮРК DESCRIPTOR SET ALLOCATE INFO; 


vdsai.pNext = nullptr; 

vdsai.descriptorPool = DescriptorPool; 
vdsai.descriptorSetCount = 4; 
vdsai.pSetLayouts = DescriptorSetLayouts; 


result = vkAllocateDescriptorSets( LogicalDevice, IN &vdsai, OUT &DescriptorSets[0] ); 


- m 
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Step 5: Tell the Descriptor Sets where their CPU Data is 163 


This struct identifies what buffer it 


VkDescriptorBufferlnfo vdbi0; | 
owns and how big it is 


vdbi0. buffer = MyMatrixUniformBuffer. buffer; 
vdbi0.offset = 0; 
vdbiO.range = sizeof(Matrices); 


VkDescriptorBufferlnfo vdbi1; 
vdbi1.buffer = MyLightUniformBuffer. buffer; 
vdbi1.offset = 0; 
vdbi1.range = sizeof(Light); 


This struct identifies what buffer it 


owns and how big it is 


VkDescriptorBufferlnfo vdbi2; 
vdbi2.buffer = MyMiscUniformBuffer.buffer; 
vdbi2.offset = 0; 
vdbi2.range = sizeof(Misc); 


This struct identifies what buffer it 


owns and how big it is 


This struct identifies what texture 


VkDescriptorlmagelnfo vdii0; sampler and image view it owns 


vdii.sampler = MyPuppyTexture.texSampler; 
vdii.imageView = MyPuppyTexture.texlmageView; 
vdii.imageLayout = VK_IMAGE_LAYOUT_SHADER_READ_ONLY_OPTIMAL; 


ТІ! 
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Step 5: Tell the Descriptor Sets where their CPU Data is 164 


This struct links a Descriptor Set to the 
VkWriteDescriptorSet vwds0; buffer it is pointing to 
Il ds 0: 


vwds0.sType = VK_STRUCTURE_TYPE_WRITE_DESCRIPTOR_SET; 
vwds0.pNext = nullptr; 

vwds0.dstSet = DescriptorSets[0]; 

vwds0.dstBinding = 0; 

vwds0.dstArrayElement = 0; 

vwds0.descriptorCount = 1; 

vwds0.descriptorType = VK_DESCRIPTOR_TYPE_UNIFORM_BUFFER; 
vwds0.pBufferlnfo = IN &vdbi0; 

vwdsO.plmagelnfo = (VkDescriptorlmagelnfo *)nullptr; 
vwds0.pTexelBufferView = (VkBufferView *)nullptr; 


idet: This struct links a Descriptor Set to 
s 1: i 2124: 
VkWriteDescriptorSet vwds1; the buffer it is pointing to 


vwds1.sType = УК STRUCTURE TYPE WRITE DESCRIPTOR SET; 
vwds1.pNext = nullptr; 

vwds1.dstSet = DescriptorSets[1]; 

vwds1.dstBinding 7 0; 

vwds1.dstArrayElement - 0; 

vwds1.descriptorCount = 1; 

vwds1.descriptorType = VK_DESCRIPTOR_TYPE_UNIFORM_BUFFER; 
vwds1.pBufferlnfo = IN &vdbi1; 

vwds1.plmagelnfo = (VkDescriptorlmagelnfo *)nullptr; 

THINK vwds1.pTexelBufferView = (VkBufferView *)nullptr; 
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Step 5: Tell the Descriptor Sets where their data is 165 


This struct links a Descriptor Set to 


the buffer it is pointing to 


VkWriteDescriptorSet vwds2; 
Il ds 2: 
vwds2.sType = УК STRUCTURE TYPE WRITE DESCRIPTOR SET; 
vwds2.pNext - nullptr; 
vwds2.dstSet = DescriptorSets[2]; 
vwds2.dstBinding = 0; 
vwds2.dstArrayElement = 0; 
vwds2.descriptorCount = 1; 
vwds2.descriptorType = VK_DESCRIPTOR_TYPE_UNIFORM_BUFFER; 
vwds2.pBufferlnfo = IN &vdbi2; 
vwds2.plmagelnfo = (VkDescriptorlmagelnfo *)nullptr; 
vwds2.pTexelBufferView = (VkBufferView *)nullptr; 


This struct links a Descriptor Set to 

Il ds 3: the image it is pointing to 
VkWriteDescriptorSet vwds3; 

vwds3.sType = ҮК STRUCTURE TYPE WRITE DESCRIPTOR SET; 

vwds3.pNext = nullptr; 

vwds3.dstSet - DescriptorSets[3]; 

vwds3.dstBinding = 0; 

vwds3.dstArrayElement = 0; 

vwds3.descriptorCount = 1; 

vwds3.descriptorType = VK_DESCRIPTOR_TYPE_COMBINED_IMAGE_SAMPLER; 

vwds3.pBufferlnfo = (VkDescriptorBufferlnfo *)nullptr; 

vwds3.plmagelnfo = IN &vdii0; 

vwds3.pTexelBufferView = (VkBufferView *)nullptr; 


uint32_t copyCount = 0; 
Il this could have been done with one call and an array of VkWriteDescriptorSets: 


vkUpdateDescriptorSets( LogicalDevice, 1, IN &vwds0, IN copyCount, (VkCopyDescriptorSet 
vkUpdateDescriptorSets( LogicalDevice, 1, IN &vwds1, IN copyCount, (VkCopyDescriptorSet 
vkUpdateDescriptorSets( LogicalDevice, 1, IN &vwds2, IN copyCount, (VkCopyDescriptorSet 
Ca SIGGR APH THINK vkUpdateDescriptorSets( LogicalDevice, 1, IN &vwds3, IN copyCount, (VkCopyDescriptorSet 


nullptr 


) 
nullptr ); 
) 
) 


nullptr 
nullptr 


ses BEYOND =ч тр — July 24, 2020 


*) 
*) 
*) 
*) . 


Step 6: Include the Descriptor Set Layout when Creating a Graphics Pipeline 166 


VkGraphicsPipelineCreatelnfo 
vgpci.sType = УК STRUCTURE ТҮР RAPHICS PIPELINE CREATE INFO; 
vgpci.pNext = nullptr; 
vgpci.flags = 0; 

#ifdef CHOICES 

VK PIPELINE CREATE DISABLE OPTIMIZATION BIT 

VK PIPELINE CREATE ALLOW DERIVATIVES BIT 

VK PIPELINE CREATE DERIVATIVE BIT 

#endif 
vgpci.stageCount = 2; Il number of stag&s in this pipeline 
vgpci.pStages - vpssci; 
vgpci.pVertexInputState = &vpvisci; 
vgpci.pInputAssemblyState = &vpiasci; 
vgpci.pTessellationState = (VkPipelineTessellationStateCreateWnfo *)nullptr; 
vgpci.pViewportState = &vpvsci; 
vgpci.pRasterizationState = &vprsci; 
vgpci.pMultisampleState = &vpmsci; 
vgpci.pDepthStencilState = &vpdssci; 
vgpci.pColorBlendState = &vpcbsci; 
vgpci.pDynamicState = 8 od 
vgpci.layout «TN GraphicsPipelineLayout> 
vgpci.renderPass = IN RenderPass; 
vgpci.subpass = 0; ІІ subpass number 
vgpci.basePipelineHandle = (VkPipeline) VK_NULL_HANDLE; 
vgpci.basePipelinelndex = 0; 


result = vkCreateGraphicsPipelines( LogicalDevice, V& NULL HANDLE, 1, IN &vgpci, PALLOCATOR, OUT &GraphicsPipeline ); 
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Step 7: Bind Descriptor Sets into the Command Buffer when Drawing vr 


| Descriptor Set 


| pipelineBindPoint | LIT descriptorSetCount 


| | vkCmdBindDescriptorSets() | / 


vkCmdBindDescriptorSets( CommandBuffers[nextImagelndex], 
VK_PIPELINE_BIND_POINT_GRAPHICS, GraphicsPipelineLayout, 
0, 4, DescriptorSets, 0, (uint32_t *)nullptr ); 


So, the Pipeline Layout contains the structure of the Descriptor Sets. 


Any collection of Descriptor Sets that match that structure can be bound into that pipeline. 


2 THINK 
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Sidebar: The Entire Collection of Descriptor Set Paths 168 


VkDescriptorPoolCreatelnfo 
vkCreateDescriptorPool( ) 


VkDescriptorSetLayoutBinding 
VkDescriptorSetLayoutCreatelnfo 


Create the pool of Descriptor 
Sets for future use 


Describe a particular Descriptor 
Set layout and use it in a specific 
Pipeline layout 


vkCreateDescriptorSetLayout( ) 
vkCreatePipelineLayout( ) 


VkDescriptorSetAllocatelnfo Allocate memory for particular 


vkAllocateDescriptorSets( ) Descriptor Sets 


VkDescriptorBufferlnfo Tell a particular 
| Descriptor Set where 
VkDescriptorlmagelnfo its CPU data is Re-write CPU data into a 
VkWriteDescriptorSet particular Descriptor Set 
vkUpdateDescriptorSets( ) 


ake a particular Descriptor Set 


M 
vkCmdBindDescriptorSets( ) nu for rendering 
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Sidebar: Why Do Descriptor Sets Need to Provide Layout Information 169 
to the Pipeline Data Structure? 


The pieces of the Pipeline Data Structure are fixed in size — with the exception of the 
Descriptor Sets and the Push Constants. Each of these two can be any size, depending on 
what you allocate for them. So, the Pipeline Data Structure needs to know how these two 
are configured before it can set its own total layout. 


Think of the DS layout as being a particular-sized hole in the Pipeline Data Structure. Any 
data you have that matches this hole’s shape and size can be plugged in there. 


The Pipeline Data Structure 


Fixed Pipeline Elements Specific.Descriptor 


Set Layout 


= THINK pra É 
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Sidebar: Why Do Descriptor Sets Need to Provide Layout Information 170 
to the Pipeline Data Structure? 


Any set of data that matches the Descriptor Set Layout can be plugged in there. 


и 
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Vulkan. 


Textures 


Mike Bailey 


mjb@cs.oregonstate.edu 


http://cs.oregonstate.edu/~mjb/vulkan 
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Triangles in an Array of Structures 172 


struct vertex 


glm::vec3 position; 
glm::vec3 normal; 
gim::ve i 


glm::vec2 texCoord; 
}; 


struct vertex VertexData[ ] = 


II triangle 0-2-3: 
ІІ vertex #0: 


THINK 
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Memory Types 173 


CPU Memory GPU Memory 


Host Device 
Visible Local 


GPU Memory GPU Memory 


memcpy( ) vkCmdCopylmage( ) 
Texture 
Sampling Hardware 


RGBA to the Shader 
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Memory Types 174 


NVIDIA Discrete Graphics: 


11 Memory Types: 

Memory 0: 

Memory 

Memory 

Memory 

Memory 

Memory 

Memory 6: 

Memory 7: DeviceLocal 

Memory 8: DeviceLocal 

Memory 9: HostVisible HostCoherent 
: HostVisible HostCoherent HostCached 


Intel Integrated Graphics: 


3 Memory Types: 

Memory 0: DeviceLocal 

Memory 1: DeviceLocal HostVisible HostCoherent 

Memory 2: DeviceLocal HostVisible HostCoherent HostCached 


THINK 
£2 SIGGRAPH HI Pere 


Texture Sampling Parameters 175 


glTexParameteri( GL TEXTURE 2D, GL TEXTURE WRAP S, GL КЕ 
glTexParameteri( GL TEXTURE 2D, GL TEXTURE WRAP T, GL REPEAT ); 
glTexParameteri( GL TEXTURE 2D, GL TEXTURE MAG FILTER, GL LINEAR ); 
glTexParameteri( GL TEXTURE 2D, GL TEXTURE MIN FILTER, GL LINEAR ); 


Vulkan 


VkSamplerCreatelnfo 
vsci.magFilter = МК FILTER LIN 
vsci.minFilter = VK FILTER LINEAR 
vsci.mipmapMode = VK SAMPLER MIPMAP MODE LINEAR; 
vsci.addressModeU = VK SAMPLER ADDRESS MODE REPEAT; 
vsci.addressModeV = VK ЅАМРІЕҢ ADDRESS MODE REPEAT; 
vsci.addressModeW = VK SAMPLER| ADDRESS MODE REPEAT; 


result = vkCreateSampler( LogicalDevice, IN &vsci, PALLOCATOR, pTextureSampler ); 


72 SIGGRAPH THINK 
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Textures’ Undersampling Artifacts 176 


As an object gets farther away and covers a smaller and smaller part of the screen, the texels 
: pixels ratio used in the coverage becomes larger and larger. This means that there are 
pieces of the texture leftover in between the pixels that are being drawn into, so that some of 
the texture image is not being taken into account in the final image. This means that the 
texture is being undersampled and could end up producing artifacts in the rendered image. 


Pixels 


Consider a texture that consists of one red texel and all the rest white. It is easy to imagine an 
object rendered with that texture as ending up all white, with the red texel having never been 
included in the final image. The solution is to create lower-resolutions of the same texture so 
that the red texel gets included somehow in all resolution-level textures. 


72 SIGGRAPH THINK 
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Texture Mip*-mapping 


Average 4 pixels to 
make a new one 


RGBA, RGBA, RGBA, күм dM ыш 
RGBA, RGBA, RGBA, RGBA, RGBA, 
‚ RGBA,RGBA, RGBA, КОВА, 
, RGBA, RGBA, RGBA, RGBA, 


j RGBA, RGBA, RGBA, RGBA, Average 4 pixels to 
, RGBA,RGBA, RGBA, RGBA, make a new one 


, RGBA, RGBA, RGBA, RGBA, 
, RGBA, RGBA, RGBA, RGBA, 
, RGBA, RGBA, RGBA, RGBA, 


e Total texture storage is ~ 2x what it was without mip-mapping 
e Graphics hardware determines which level to use based on the texels : pixels ratio. 


° In addition to just picking one mip-map level, the rendering system can sample from 
two of them, one less that the T:P ratio and one more, and then blend the two RGBAs 
returned. This is known as VK SAMPLER MIPMAP MODE LINEAR. 


* Latin: multim in parvo, "many things in a small place" 
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VkResult 


VkResult result; 


VkSamplerCreatelnfo 


Init07TextureSampler( MyTexture * pMyTexture ) 
( 

vsci.sType = УК STRUCTURE ТҮ 
vsci.pNext = nullptr; 


vsci.flags 7 0; 


vsci.magFilter = VK FILTER LINEAR; 
vsci.minFilter = VK FILTER LINEAR; 
vsci.mipmapMode = VK SAMPLER МІРМ 
vsci.addressModeU = VK SAMPLER ADD 
vsci.addressModeV = VK SAMPLER ADD 
vsci.addressModeW = VK SAMPLER ADD 
#ifdef CHOICES 
VK SAMPLER ADDRESS MODE REPEAT 
VK SAMPLER ADDRESS MODE MIRRORED РЕВ 
VK SAMPLER ADDRESS MODE CLAMP TO EDG 
VK SAMPLER ADDRESS MODE CLAMP TO BORDER 
VK SAMPLER ADDRESS MODE MIRROR СІАМР TO EDGE 
#endif 


R_CREATE_INFO; 


P_MODE_LINEAR; 
RESS_MODE_REPEAT; 
ESS_MODE_REPEAT; 
ESS_MODE_REPEAT; 


AT 


vsci.mipLodBias = 0.; 
vsci.anisotropyEnable = VK_FALSE; 
vsci.maxAnisotropy = 1.; 
vsci.compareEnable = VK_FALSE; 
vsci.compareOp = VK COMPARE OP NEVER; 

#ifdef CHOICES 

VK COMPARE OP NEVER 

VK COMPARE OP LESS 

VK COMPARE OP EQUAL 

VK COMPARE OP LESS OR EQUAL 

VK COMPARE OP GREATER 

VK COMPARE OP NOT EQUAL 

VK COMPARE OP GREATER OR EQUAL 

VK COMPARE OP ALWAYS 

#endif 


vsci.minLod = 0.; 

vsci.maxLod = 0.; 

vsci.borderColor = VK_BORDER_COLOR_FROAT_OPAQUE_BLACK; 
#ifdef CHOICES 
VK_BORDER_COLOR_FLOAT_TRANSPARENT_BLAPK 
VK_BORDER_COLOR_INT_TRANSPARENT_BLACK 
VK_BORDER_COLOR_FLOAT_OPAQUE_BLACK 
VK_BORDER_COLOR_INT_OPAQUE_BLACK 
VK_BORDER_COLOR_FLOAT_OPAQUE_WHITE 
VK_BORDER_COLOR_INT_OPAQUE_WHITE 
#endif 


vsci.unnormalizedCoordinates = VK_FALSE; 
II VK FALSE 


Il VK TRUE means we are use raw texels as the index 
leans we are using the usual 0. - 1. 


result = vkCreateSampler( LogicalDevice, IN &vsci, PALLOCATOR, OUT &pMyTexture->texSampler ); 
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VkResult 
Init07TextureBuffer( INOUT MyTexture * pMyTexture) 


{ 
VkResult result; 


uint32_t texWidth = pMyTexture->width;; 

uint32 t texHeight = pMyTexture->height; 

unsigned char *texture = pMyTexture->pixels; 

VkDeviceSize textureSize = texWidth * texHeight * 4; Il rgba, 1 byte each 


Vklmage staginglmage; 
Vklmage texturelmage; 


ыны 


Il this first {...} is to create the staging image: 


[|| "ЖЖ ЖКХ 


VklmageCreatelnfo 
vici.sType = VK_STRUCTSR PE_IMAGE_CREATE_INFO; 


vici.pNext = nullptr; 
vici.flags = 0; 
vici.imageType = VK_IMAGE_TYPE_2D; 
vici.format = VK_FORMAT_R8G8B8A8_UNORM; 
vici.extent.width = texWidth; 
vici.extent.height = texHeight; 
vici.extent.depth = 1; 
vici.mipLevels = 1; 
vici.arrayLayers = 1; 
vici.samples = VK_SAMPLE_COUNT_1_ BIT; 
vici.tiling = VK_IMAGE_TILING_LINEAR; 
#ifdef| CHOICES 
VK_IMAGE_TILING_OPTIMAL 
VK_IMAGE_TILING_LINEAR 
#endi 


vici.usage = VK_IMAGE_USAGE_TRANSFER_SRC_BIT; 
#ifdef| CHOICES 
VK IMAGE USAGE TRANSFER SRC BIT 
VK IMAGE USAGE TRANSFER DST ВІТ 
VK IMAGE USAGE SAMPLED BIT 
VK IMAGE USAGE STORAGE ВІТ 
VK IMAGE USAGE COLOR ATTACHMENT BIT 
VK IMAGE USAGE DEPTH STENCIL ATTACHMENT BIT 
VK IMAGE USAGE TRANSIENT ATTACHMENT ВІТ 
VK АСЕ USAGE INPUT ATTACHMENT ВІТ 
#endi 


e SIGGRAPH THINK vici.sharingMode = VK_SHARING_MODE_EXCLUSIVE; 
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#ifdef[CHOICES 1 80 
VK_INAGE_LAYOUT_UNDEFINED 
VK_INAGE_LAYOUT_PREINITIALIZED 
#епаї 
vici.queueFamilyIndexCount = 0; 
vici.pQueueFamilyIndices = (const uint3P t *)nullptr; 


result = vkCreatelmage(LogicalDevice, IN &vici, PALLOCATOR, OUT &staginglmage); // allocated, but not filled 


VkMemoryRequirements vmr; 
vkGetlmageMemoryRequirements( LogicalDevice, IN staginglmage, OUT &vmr); 


if (Verbose) 
{ 
fprintf(FpDebug, "Image vmr.size = %lld\n", vmr.size); 
fprintf(FpDebug, "Image vmr.alignment = “%lld\n", vmr.alignment); 
fprintf(FpDebug, "Image vmr.memoryTypeBits = 0x%08x\n", vmr.memoryTypeBits); 
fflush(FpDebug); 
} 


VkMemoryAllocatelnfo 
vmai.sType = VK_STRUC 
vmai.pNext = nullptr; 
vmai.allocationSize = vmr.size; 
vmai.memoryTypelndex = FindMemory 


:_MEMORY_ALLOCATE_INFO; 


atlsHostVisible(); // because we want to ттар it 


VkDeviceMemory 
result = vkAllocateMemory( L 
pMyTexture->vdm = vdm; 


Весе, ai, PALLOCATOR, OUT &vdm); 


result = vkBindlmageMemory( LogicalDevice, IN staginglmage, ІМ уат, 0); // 0 = offset 
Il we have now created the staging image - fill it with the pixel data: 


VkImageSubresource 
vis.aspectMask = VK ІМА ASBPCT 


vis.mipLevel = 0; E Е 
vis.arrayLayer = 0; 


OR ВІТ; 


VkSubresourceLayout vsl; 
vkGetlmageSubresourceLayout( LogicalDevice, staginglmage, IN &vis, OUT &vsl); 


if (Verbose) 

{ 
fprintf(FpDebug, "Subresource Layout:\n"); 
fprintf(FpDebug, "\toffset = %lld\n", vsl.offset); 
fprintf(FpDebug, "\tsize = %lld\n", vsl.size); 
fprintf(FpDebug, "\trowPitch = %lld\n", vsl.rowPitch); 
fprintf(FpDebug, "\tarrayPitch = %lld\n", vsl.arrayPitch); 
fprintf(FpDebug, "\tdepthPitch = %lld\n", vsl.depthPitch); 
fflush(FpDebug); 


22 SIGGRAPH 1н | 


„azs BEYOND mjb — July 24, 2020 


181 


void * gpuMemory; 
vkMapMemory( LogicalDevice, vdm, 0, VK_WHOLE_ SIZE, 0, OUT &gpuMemory); 
Il 0 and 0 = offset and memory map flags 


if (vsl.rowPitch == 4 * texWidth) 


{ 
memcpy(gpuMemory, (void *)texture, (size t)textureSize); 
else 
( 
unsigned char *gpuBytes = (unsigned char *)gpuMemory; 
for (unsigned int y = 0; y « texHeight; y++) 
memcpy(&gpuBytes[y * vsl.rowPitch], &texture[4 * y * texWidth], (size t)(4*texWidth) ); 
} 
} 


vkUnmapMemory( LogicalDevice, vdm); 


Паны 
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"Ым ыы ыы ыы ыы 


// this second (...) is to create the actual texture image: 


ПЫ ыы ы ыны 


VklmageCreatelnfo «э. 


vici.sType = VK_STRUCTÙ ТУРЕ IMAGE CREATE INFO; 

vici.pNext = nullptr; 

vici.flags = 0; 

vici.imageType = VK_IMAGE_TYPE_2D; 

vici.format = VK FORMAT R8G8B8A8 UNORM; 

vici.extent.width = texWidth; 

vici.extent.height = texHeight; 

vici.extent.depth = 1; 

vici.mipLevels = 1; 

vici.arrayLayers = 1; 

vici.samples = VK_SAMPLE_COU 1_BIT; 

vici.tiling = VK IMAGE TILING OPTIMAL; 

vici.usage = УК IMAGE USAGE TRANSFER DST ВІТ | УК IMAGE USAGE SAMPLED ВІТ; 
ІІ because we are transferring into it and will eventual sample from it 

vici.sharingMode = V&K SHARING MQDE EXCLUSIVE; 

vici.initialLayout = МК IMAGE LAYOUT PREINITIALIZED; 

vici.queueFamilyIndexCount = 0; 

vici.pQueueFamilyIndices = (const uint$2 t *)nullptr; 


result = vkCreatelmage(LogicalDevice, IN &vici, PALLOCATOR, OUT &texturelmage); // allocated, but not filled 


VkMemoryRequirements vmr; 
vkGetlmageMemoryRequirements( LogicalDevice, IN texturelmage, OUT &vmr); 


if( Verbose ) 


fprintf( FpDebug, "Texture vmr.size = %lld\n", vmr.size ); 
fprintf( FpDebug, "Texture vmr.alignment = %lld\n", vmr.alignment ); 
fprintf( FpDebug, "Texture vmr.memoryTypeBits = 0x%08x\n", vmr.memoryTypeBits ); 
fflush( FpDebug ); 
} 
VkMemoryAllocatelnfo «э. 
vmai.sType = VK_STRUCTORE=TYPE_MEMORY_ALLOCATE_INFO; 
vmai.pNext = nullptr; 


vmai.allocationSize = vmr.size; 
vmai.memoryTypelndex = FindMemo 


VkDeviceMemory «Ээ 
result = vkAllocateMemory( 5уеаШейсе, | 


result = vkBindlmageMemory( LogicalDevice, IN texturelmage, ІМ vdm, 0 ); ПО = offset 


ThatlsDeviceLocal( ); // because we want to sample from it 


LPALLOCATOR, OUT &vdm); 


JJ eicit ыы ыы ы ыы ыы 
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ІІ copy pixels from the staging image to the texture: 


VkCommandBufferBeginInfo (Сазы?) 
vcbbi.sType -ҮК ЭТКОСТОКЕ ШЫР 
vcbbi.pNext = nullptr; 


vcbbi.flags = УК COMMAND BUFFER USAGPwQNE TIME SUBMIT ВІТ; 
vcbbi.pInheritancelnfo = (VKCommandBufferlnheritartegInfo *)nullptr; 


OMMAND BUFFER BEGIN INFO; 


result = vkBeginCommandBuffer( TextureCommandBuffer, IN &vcbbi); 


"Ым ыы dee ke dee ke dee ke deee ie 


// transition the staging buffer layout: 


"Ым kedede kedede kede de kede deske e desk Ae de ke e de ke e de ke e e k de ke k de ke k de k Ae de k Ae de k dee ke deke dee ke deekie 


VklmageSubresourceRange СЭ 


visr.aspectMask = VK_IMAGE_A Я 
visr.baseMipLevel = 0; 
visr.levelCount = 1; 
visr.baseArrayLayer = 0; 
visr.layerCount = 1; 


COLOR_BIT; 


VkImageMemoryBarrier 
vimb.sType = VK_STRUCT 
vimb.pNext = nullptr; 


GE _LAYOUT_TRANSFER_SRC_OPTIMAL; 
vimb.srcQueueFamilyIndgx = VK QUEUE FAMILY IGNORED; 
vimb.dstQueueFamilyIng K_QUEUE_FAMILY_IGNORED; 


STAGE_HOST_BIT, VK_PIPELINE_STAGE_HOST_BIT, 0, 
Barrier *)nullptr, 

0, (VkBuffa#MemoryBarrier *)nullptr, 

1, IN &vimb ); 


Пон 
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// transition the texture buffer layout: 


"Ын ыны ыы ыы ыы ыы ыы 


VklmageSubresourceRange «o 
visr.aspectMask = VK IMAGE *^8PEe 
visr.baseMipLevel = 0; 
visr.levelCount = 1; 
visr.baseArrayLayer = 0; 
visr.layerCount = 1; 


COLOR_BIT; 


VkImageMemoryBarrier 
vimb.sType = VK_STRUCT 
vimb.pNext = nullptr; 


vimb.srcQueueFamilylydex = /К QUEUE FAMILY IGNORED; 
vimb.dstQueueFamil K_QUEUE_FAMILY_IGNORED; 
vimb.image = texture 
vimb.srcAccessMagk = 0; 

Ask = VK WCCESS TRANSFER WRITE BIT; 
Range - visr; 


VK PIPELIME STAGE TOP OF PIPE BIT, VK PIPELINE STAGE TRANSFER BIT, 0, 
0, (VkMergoryBarrier *)nullptr, 

0, (VkBuffterMemoryBarrier *)nullptr, 

1, IN &vimb); 


Il now do the final image transfer: 


VklmageSubresourceLayers visl; 
visl.aspectMask = ҮК IMAGE ASPECT COLOR BIT; 
visl.baseArrayLayer - 0; 
visl.mipLevel = 0; 
visl.layerCount = 1; 


VkOffset3D уо3; 
vo3.x = 0; 
vo3.y = 0; 
vo3.z = 0; 

VkExtent3D ve3; 


ve3.width = texWidth; 
ve3.height = texHeight; 
ve3.depth = 1; 
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үкітадеСору 
vic.srcSubresource = vis 
vic.srcOffset = vo3; 
vic.dstSubresource = visl; 
vic.dstOffset = vo3; 
vic.extent = ve3; 


vkCmdCopylmage(TextureCommandBuffer, 
staginglmage, VK_IMAGE_LAYOUT_TRANSFER_SRC_OPTIMAL, 
texturelmage, VK_IMAGE_LAYOUT_TRANSFER_DST_OPTIMAL, 1, IN &vic); 


ПЫ ыны ыы ыы ыы ыы ыы 
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ПМ ыы ыы kehe dedede kesk Ae Bede ыы ыы 


Il transition the texture buffer layout a second time: 


"Шыныны ыы ыы ыы ыы ынны ыы 


VklmageSubresourceRange 
visr.aspectMask = VK IMAGE 
visr.baseMipLevel = 0; 
visr.levelCount = 1; 
visr.baseArrayLayer = 0; 
visr.layerCount = 1; 


COLOR_BIT; 


VkIlmageMemoryBarrier «» 


vimb.sType = V& ӨТКОСТУКЕТТТРЕ IMAGE MEMORY BARRIER; 
vimb.pNext = nullptr; 
vimb.oldLayout = VK IMA 
vimb.newLayout = VK I 
vimb.srcQueueFamilylydex = 
vimb.dstQueueFamilyfhdex = 
vimb.image = texturgfmage; 
vimb.srcAccessMagk = 0; 
vimb.dstAccessMAsk = VK 
vimb.subresour 


E IJAYOUT TRANSFER DST OPTIMAL; 
АСЕ АҮООТ SHADER READ ONLY OPTIMAL; 
K QUEUE FAMILY. IGNORED; 
K QUEUE FAMILY IGNORED; 


CCESS SHADER READ BIT; 
Range = visr; 


vkCmdPipelineBarfler(TextureCommandBuffer, 
VK PIPELINE STAGE TRANSFER ВІТ, VK PIPELINE STAGE FRAGMENT SHADER ВІТ, 0, 
0, (VkMemfryBarrier *)nullptr, 
0, (VkBuff&rMemoryBarrier *)nullptr, 
1, IN &vimb); 


ыы 


result = vkEndCommandBuffer( TextureCommandBuffer ); 


VkSubmitInfo 

vsi.sType = VK STR 
vsi.pNext = nullptr; 
vsi.commandBufferCount = 1; 
vsi.pCommandBuffers = &TextureCommandBuffer; 
vsi.waitSemaphoreCount = 0; 
vsi.pWaitSemaphores = (УкЗетаДһоге *)nullptr; 
vsi.signalSemaphoreCount = 0; 
vsi.pSignalSemaphores = (VkSemahhore *)nullptr; 
vsi.pWaitDstStageMask = (VkPipelingStageFlags *)nullptr; 


YPE_SUBMIT_INFO; 


result = vkQueueSubmit( Queue, 1, IN &vsi, VK NULL HANDLE ); 
result = vkQueueWaitldle( Queue ); 
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// create an image view for the texture image: 
Il (an “image view” is used to indirectly access an i e) 


VklmageSubresourceRange Суы; ) 
visr.aspectMask = МК IMAGE АБРЕСТ COLOR B 
visr.baseMipLevel = 0; 
visr.levelCount = 1; 
visr.baseArrayLayer = 0; 
visr.layerCount = 1; 


Access to The Actual 


an Image Image Data 


VklmageViewCreatelnfo 
vivci.sType = УК STRUCTURB 
vivci.pNext = nullptr; 
vivci.flags 7 0; 


vivci.image = texturelmage; 8 bits Red 8 bits Green 8 bits Blue 8 bits Alpha 
vivci.viewType = VK IMAGE VIEW TYPE 2D; 


vivci.format = УК FORMAT IiR8G8B8A8 UNORM; 

vivci.components.r = УК COMPONENTASWIZZLE R; 
vivci.components.g = УК COMPONENT YSWIZZLE G; 
vivci.components.b = УК СОМРОМЕМТ $WIZZLE B; 
vivci.components.a = VK COMPONENT SWIZZLE А; 
vivci.subresourceRange - visr; 


PE IMAGE VIEW CREATE INFO; 


result = vkCreatelmageView( LogicalDevice, IN &vivci, PALLOCATOR, OUT &pMyTexture->texImageView); 


return result; 


Note that, at this point, the Staging Buffer is no longer needed, and can be destroyed. 
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result = Init06TextureBufferAndFillFromBmprFile ( “puppy.bmp”, &MyTexturePuppy); 
Init06TextureSampler( &MyPuppyTexture.texSampler ); 


This function can be found in the sample.cpp file. The BMP file needs to be created by something 
that writes uncompressed 24-bit color BMP files, or was converted to the uncompressed BMP format 


Reading in a Texture from a BMP File 


typedef struct MyTexture 

{ 
uint32 t width; 
uint32 t height; 
Vklmage іехітаде; 
VklmageView texImageView; 
VkSampler texSampler; 
VkDeviceMemory vdm; 

} MyTexture; 


MyTexture MyPuppyTexture; 


by a tool such as ImageMagick's convert, Adobe Photoshop, or GNU’s GIMP. 
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Vulkan. 


Queues and Command Buffers 


Mike Bailey 


mjb@cs.oregonstate.edu 


http://cs.oregonstate.edu/~mjb/vulkan 
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Simplified Block Diagram 190 


and Би ег 
ind Buffer 
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Vulkan Queues and Command Buffers 191 
e Graphics commands are recorded in command buffers, e.g., vkCmdDoSomething( cmaBuffer, ... ); 
* You can have as many simultaneous Command Buffers as you want 
* Each command buffer can be filled from a different thread 
* Command Buffers record commands, but no work takes place until a Command Buffer is submitted to a Queue 


Application 


e We don't create Queues - the Logical Device has them already ТТТ 


_ Physical | 
Device 


e Each Queue belongs to a Queue Family 


* We don't create Queue Families — the Physical Device already has them 


— 
Ста buffer| —> 
— 
— 


Cmd buffer 


Cmd buffer 
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Querying what Queue Families are Available 192 


uint32_t count; 
vkGetPhysicalDeviceQueueFamilyProperties( IN PhysicalDevice, &count, OUT (VkQueueFamilyProperties *) nullptr ); 


VkQueueFamilyProperties *vqfp = new VkQueueFamilyProperties[ count ]; 
vkGetPhysicalDeviceFamilyProperties( PhysicalDevice, &count, OUT &vafp, ); 


for( unsigned int i = 0; i < count; i++ ) 
{ 
рип FpDebug, "\t%d: Queue Family Count = %2d ; ", i, vgfp[i].queueCount ); 
ІК ( vqfp[i].queueFlags & VK_QUEUE_GRAPHICS ВІТ ) != 0 ) fprintf( FpDebug, " Graphics" ); 
ІК ( vqfp[i].queueFlags «УК QUEUE COMPUTE BIT )!=0) fprintf( FoDebug, " Compute " ); 
ІК ( vgfp[i].queueFlags & УК QUEUE TRANSFER BIT )!= 0 ) fprintf( FoDebug, " Transfer" ); 
fprintf(FpDebug, n"); 


Found 3 Queue Families: 
0: Queue Family Count = 16 ; Graphics Compute Transfer 
1: Queue Family Count = 1; Transfer 
2: Queue Family Count - 8; Compute 


H ru 
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Similarly, we Can Write a Function that Finds the Proper Queue Family 193 


int 
FindQueueFamilyThatDoesGraphics( ) 
{ 
uint32 t count = -1; 
vkGetPhysicalDeviceQueueFamilyProperties( IN PhysicalDevice, OUT &count, OUT (VkQueueFamilyProperties *)nullptr ); 
VkQueueFamilyProperties new VkQueueFamilyProperties[ count ]; 
vkGetPhysicalDeviceQueugFamilyProperties( IN PhysicalDevice, IN &count, OUT ма ); 
for( unsigned int | ЖО; i < count; i++ ) 
ІК ( vgfp[ i ].queueFlags & VK QUEUE GRAPHICS BIT ) != 0 ) 
return i; 
} 
return -1; 
} 
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Creating a Logical Device Needs to Know Queue Family Information 194 


Application 


vdqci[0].queueFamilyInd& 
vdqci[0].queueCount = 1; 


vdci.sType = VK \ 
vdci.pNext = nullptr; 
vdci.flags = 0; 
vdci.queueCreatelnfoCo Il # of device queues wanted 
vdci.pQueueCreatelnfos = IN, &vdqci[0]; II array of VkDeviceQueueCreatelnfo's 
vdci.enabledLayerCount = siz@gf(myDeviceLayers) / sizeof(char *); 
vdci.ppEnabledLayerNames = myQeviceLayers; 

vdci.enabledExtensionCount = sizeòf(myDeviceExtensions) / sizeof(char *); 
vdci.ppEnabledExtensionNames = myNeviceExtensions; 

vdci.pEnabledFeatures = IN &PhysicalDewjceFeatures; // already created 


result = vkCreateLogicalDevice( PhysicalDevice, IN &vdci, PALLOCATOR, OUT &LogicalDevice ); 
VkQueue Queue; 


uint32 t queueFamilyIndex = FindQueueFamilyThatDoesGraphics( ); 
uint32_t queuelndex = 0; 


result = vkGetDeviceQueue ( LogicalDevice, queueFamilyIndex, queuelndex, OUT &Queue ); 
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Creating the Command Pool as part of the Logical Device 195 


VkResult 
Init06CommandPool[( ) 


VkResult result; 


VkCommandPoolCreatelnfo 
vcpci.sType = УК STRUCTURE ТҮР 
vcpci.pNext - nullptr; 
vcpci.flags = VK COMMAND POOL _ 

| VK COMMAND POOL CR 


COMMAND POOL CREATE INFO; 


REATE RESET COMMAND BUFFER BIT 

RATE TRANSIENT ВІТ; 

#ifdef CHOICES 

VK COMMAND POOL CREATE TRANSIENT BIT 

VK COMMAND POOL CREATE RESET COMMAND \BUFFER_BIT 

#endif 
vcpci.queueFamilyIndex = FindQueueFamilyTha\DoesGraphics( ); 


result = vkCreateCommandPool( LogicalDevice, IN &\срсі, PALLOCATOR, OUT &CommandPool ); 


return result; 
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Creating the Command Buffers 196 


VkResult 
Init06CommandBuffers( ) 


VkResult result; 


Il allocate 2 command buffers for the double-buffered rendering: 


Application 


VkCommandBufferAllocatelnfo 
vcbai.sType = VK STRUCTURE TYP 
vcbai.pNext = nullptr; 
vcbai.commandPool = CommandPool; 
vcbai.level = VK COMMAND BUFFER LEVEL PRIMARY; 
vcbai.commandBufferCount = 2; ІІ 2, because òf double-buffering 


OMMAND BUFFER ALLOCATE INFO; 


Physical 
Device 


result = vkAllocateCommandBuffers( LogicalDevice, IN &vcbai, OUT &CommandBuffers[0] ); 


Command Buffer 


allocate 1 command buffer for the transferring pixels from a staging buffer to a texture buffer: 


VkCommandBufferAllocatelnfo Cvebai; > 


vcbai.sType = VK_STRUCTURE_TYP OMMA 
vcbai.pNext = nullptr; 
vcbai.commandPool = CommandPool; 
vcbai.level = МК COMMAND BUFFER LEVEL P 
vcbai.commandBufferCount = 1; 


ND BUFFER ALLOCATE INFO; 
MARY; 
result = vkAllocateCommandBuffers( LogicalDevice, IN &vcbai, OUT &TextureCommandBuffer ); 


} 


return result; 
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Beginning a Command Buffer — One per Image 


VkSemaphoreCreatelnfo 
vsci.sType = УК STRUCTURE ТҮРЕ” 
vsci.pNext - nullptr; 
vsci.flags = 0; 


DBEMAPHORE CREATE INFO; 


VkSemaphore imageReadySemaphore; 
result = vkCreateSemaphore( LogicalDevice, IN &vsci, PALLOCATOR, OUT&IimageReadySemaphor® 


uint32 t nextlmagelndex; 
vkAcquireNextlmageKHR( LogicalDevice, IN SwapChai 


VkCommandBufferBeginInfo 
vcbbi.sType = VK_STRUCTUR 
vcbbi.pNext = nullptr; 
vcbbi.flags = V&K COMMAND BUFFER USAGE ОМ 
vcbbi.pInheritancelnfo = (VKCommandBufferlnheritangefnfo 


. SUBMIT BIT; 
qullptr; 


result = vkBeginCommandBuffer( CommandBuffers[nextlImagelndex], IN &vcbbi ); 


vkEndCommandBuffer( CommandBuffers[nextlmagelndex] ); 
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Beginning a Command Buffer 198 


VkCommandBufferPoolCreatelnfo 


vkCreateCommandBufferPool( ) 


VkCommandBufferAllocatelnfo 


VkCommandBufferBeginInfo 


vkAllocateCommandBuffer( ) 


vkBeginCommandBuffer( ) 
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These are the Commands that could be entered into the Command Buffer, | 199 


vkCmdBeginQuery( commandBuffer, flags ); 

vkCmdBeginRenderPass( commandBuffer, const contents ); 
vkCmdBindDescriptorSets( commandBuffer, pDynamicOffsets ); 
vkCmdBindIndexBuffer( commandBuffer, indexType ); 

vkCmdBindPipeline( commandBuffer, pipeline ); 

vkCmdBindVertexBuffers( commandBuffer, firstBinding, bindingCount, const pOffsets ); 
vkCmaBlitlmage( commandBuffer, filter ); 

vkCmdClearAttachments( commandBuffer, attachmentCount, const pRects ); 
vkCmdClearColorlmage( commandBuffer, pRanges ); 

vkCmdClearDepthStencillmage( commandBuffer, pRanges ); 

vkCmdCopyBuffer( commandBuffer, pRegions ); 

vkCmdCopyBufferTolmage( commandBuffer, pRegions ); 

vkCmdCopylmage( commandBuffer, pRegions ); 

vkCmdCopylmageToBuffer( commandBuffer, pRegions ); 
vkCmdCopyQueryPoolResults( commandBuffer, flags ); 
vkCmdDebugMarkerBeginEXT( commandBuffer, pMarkerlnfo ); 
vkCmdDebugMarkerEndEXT ( commandBuffer ); 

vkCmdDebugMarkerlnsertEXT( commandBuffer, pMarkerlnfo ); 

vvkCmdDispatch( commandBuffer, groupCountX, groupCountY, groupCountZ ); 
vkCmabDispatchlndirect( commandBuffer, offset ); 

vkCmdDraw( commandBuffer, vertexCount, instanceCount, firstVertex, firstInstance ); 
vkCmdDrawlndexed( commandBuffer, indexCount, instanceCount, firstIndex, int32 t vertexOffset, firstInstance ); 
vkCmdDrawlndexedlndirect( commandBuffer, stride ); 
vkCmdDrawlndexedIndirectCountAMD( commandBuffer, stride ); 

vkCmdDrawlndirect( commandBuffer, stride ); 

vkCmdDrawindirectCountAMD( commandBuffer, stride ); 

vkCmdEndQuery( commandBuffer, query ); 

vkCmdEndRenderPass( commandBuffer ); 

vkCmdExecuteCommands( commandBuffer, commandBufferCount, const pCommandBuffers ); 
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These are the Commands that could be entered into the Command Buffer, Il 200 


vkCmdFillBuffer( commandBuffer, dstBuffer, dstOffset, size, data ); 

vkCmdNextSubpass( commandBuffer, contents ); 

vkCmdPipelineBarrier( commandBuffer, srcStageMask, dstStageMask, dependencyFlags, memoryBarrierCount, VkMemoryBarrier 

pMemoryBarriers, bufferMemoryBarrierCount, pBufferMemoryBarriers, imageMemoryBarrierCount, plmageMemoryBarriers ); 

vkCmdProcessCommandsNVX( commandBuffer, pProcessCommandslnfo ); 

vkCmdPushConstants( commandBuffer, layout, stageFlags, offset, size, pValues ); 

vkCmdPushDescriptorSetKHR( commandBuffer, pipelineBindPoint, layout, set, descriptorWriteCount, pDescriptorWrites ); 

vkCmdPushDescriptorSetWithTemplateKHR( commandBuffer, descriptorUpdateTemplate, layout, set, pData ); 

vkCmdReserveSpaceForCommandsNVX( commandBuffer, pReserveSpacelnfo ); 

vkCmdResetEvent( commandBuffer, event, stageMask ); 

vkCmdResetQueryPool( commandBuffer, queryPool, firstQuery, queryCount ); 

vkCmdResolvelmage( commandBuffer, srclmage, srclmageLayout, dstlmage, dstImageLayout, regionCount, pRegions ); 

vkCmdSetBlendConstants( commandBuffer, blendConstants[4] ); 

vkCmdSetDepthBias( commandBuffer, depthBiasConstantFactor, depthBiasClamp, depthBiasSlopeFactor ); 

vkCmdSetDepthBounds( commandBuffer, minDepthBounds, maxDepthBounds ); 

vkCmdSetDeviceMaskKHX( commandBuffer, deviceMask ); 

vkCmdSetDiscardRectangleEXT( commandBuffer, firstDiscardRectangle, discardRectangleCount, pDiscardRectangles ); 

vkCmdSetEvent( commandBuffer, event, stageMask ); 

vkCmdSetLineWidth( commandBuffer, lineWidth ); 

vkCmdSetScissor( commandBuffer, firstScissor, scissorCount, pScissors ); 

vkCmdSetStencilCompareMask( commandBuffer, faceMask, compareMask ); 

vkCmdSetStencilReference( commandBuffer, faceMask, reference ); 

vkCmdSetStencilWriteMask( commandBuffer, faceMask, writeMask ); 

vkCmdSetViewport( commandBuffer, firstViewport, viewportCount, pViewports ); 

vkCmdSetViewportWScalingNV( commandBuffer, firstViewport, viewportCount, pViewportWScalings ); 

vkCmdUpdateBuffer( commandBuffer, dstBuffer, dstOffset, dataSize, pData ); 

vkCmdWaitEvents( commandBuffer, eventCount, pEvents, srcStageMask, dstStageMask, memoryBarrierCount, pMemoryBarriers, 
bufferMemoryBarrierCount, pBufferMemoryBarriers, imageMemoryBarrierCount, plmageMemoryBarriers ); 

vkCmdWriteTimestamp( commandBuffer, pipelineStage, queryPool, query ); 


ж 
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VkResult 
RenderScene( ) 
{ 
VkResult result; 
VkSemaphoreCreatelnfo 
vsci.sType = VK STRUCTURE TYP EMAPHORE CREATE INFO; 
vsci.pNext = nullptr; 
vsci.flags = 0; 


VkSemaphore imageReadySemaphore; 
result = vkCreateSemaphore( LogicalDevice, IN &vsci, PALLOCATOR, OUT &imageReadySemaphore ); 


uint32 t nextlmagelndex; 
vkAcquireNextlmageKHR( LogicalDevice, ІМ SwapChain, IN _ MAX, IN VK NULL HANDLE, 
IN VK NULL HANDLE, ОЯТ &nextlmagelndex ); 


VkCommandBufferBeginInfo Cvebbi; > 
vebbi.sType = VK STRUCTURE T? 
vcbbi.pNext = nullptr; 


vcbbi.plnheritancelnfo = (VkCommandBufferlnheritancgInfo *)ntntr; 


result = vkBeginCommandBuffer( CommandBuffers[nextlmagelndex], IN &vcbbi ); 
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VkClearColorValue 
vccv.float32[0] = 0.0; 
vccv.float32[1] = 0.0; 
vccv.float32[2] = 0.0; 
vccv.float32[3] = 1.0; 


VkClearDepthStencilValue 
vcdsv.depth = 1.f; 
vcdsv.stencil = 0; 


VkClearValue 
vcv[0].color = vccv; 
vcv[1].depthStencil = vcdsv; 


VkOffset2D o2d = { 0, 0}; 
VkExtent2D.e2d = { Width, Height }; 
VkRect2I{ r2d 2 02d, e2d }; 


VkRenderPassk 


едіпіпіо 


vrpbi.renderArea = r2d; 
vrpbi.clearValueCount 
vrpbi.pClearValues = үсу; Il used for УК ATTACHMENT LOAD OP CLEAR 


vkCmdBeginRenderPass( CommandBuffers[nextlmagelndex], IN &vrpbi, IN VK SUBPASS CONTENTS INLINE ); 


m 
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0., Il y 

(float)Width, 

(float)Height, 

0., // minDepth 
Il maxDepth 


vkCmdSetViewport( CommandBuffers[nextlmagelndex], 0, 1, IN &viewport ); Il O-firstViewport, 1=viewportCount 


vkCmdSetScissor( CommandBuffers[nextlmagelndex], 0, 1, IN &scissor ); 
vkCmdBindDescriptorSets( CommandBuffers[nextlmagelndex], V& PIPELINE BIND POINT GRAPHICS, 
GraphicsPipelineLayout, 0, 4, DescriptorSets, 0, (uint32 t *)nullptr ); 
Il dynamic offset count, dynamic offsets 
vkCmdBindPushConstants( CommandBuffers[nextlmagelndex], PipelineLayout, УК SHADER STAGE ALL, offset, size, void *values ); 
VkBuffer buffers[1] = ( MyVertexDataBuffer.buffer }; 


VkDeviceSize offsets[1] = í 0 }; 


vkCmdBindVertexBuffers( CommandBuffers[nextlmagelndex], 0, 1, buffers, offsets ); Il 0, 1 = firstBinding, bindingCount 


const uint32 t vertexCount = sizeof(VertexData) / sizeof(VertexData[0]); 

const uint32 t instanceCount 7 1; 

const uint32_t firstVertex = 0; 

const uint32_t firstInstance = 0; 

vkCmdDraw( CommandBuffers[nextlmagelndex], vertexCount, instanceCount, firstVertex, firstInstance ); 


vkCmdEndRenderPass( CommandBuffers[nextlmagelndex] ); 


vkEndCommandBuffer( CommandBuffers[nextlmagelndex] ); 
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Submitting a Command Buffer to a Queue for Execution 204 


VkSubmitInfo vsi; 
vsi.sType = УК STRUCTURE TYPE SUBMIT INFO; 
vsi.pNext = nullptr; 
vsi.commandBufferCount = 1; 
vsi.pCommandBuffers = &CommandBuffer; 
vsi.waitSemaphoreCount = 1; 
vsi.pWaitSemaphores = imageReadySemaphore; 
vsi.signalSemaphoreCount = 0; 
vsi.pSignalSemaphores = (VkSemaphore *)nullptr; 
vsi.pWaitDstStageMask = (VkPipelineStageFlags *)nullptr; 
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The Entire Submission / Wait / Display Process 205 


vici.sType = VK STRUCTO FENCE_CREATE_INFO; 
vfci.pNext = nullptr; 
vfci.flags = 0; 


VkFence renderFence; 
vkCreateFence( LogicalDevice, IN &vfci, PALLOCATOR, OUQ 
result = VK SUCCESS; 


render ence} 


VkPipelineStageFlags waitAtBottom = VK_PIPELINE_ STAGE _BOTT'OM_OF_PIPE_BIT; 
VkQueue presentQueue; 
vkGetDeviceQueue( LogicalDevice, FindQueueFamilyThatDoesGyaphics( ), 0, OUT &presentQueue );\ 

I 0 =, queuelndex 


VkSubmitInfo 
vsi.sType = VK STRUCTURR 
vsi.pNext = nullptr; 
vsi.waitSemaphoreCount = 1; 
vsi.pWaitSemaphores = &imageR&adySemaphore; 
vsi.pWaitDstStageMask = &waitAtBattom; 
vsi.commandBufferCount = 1; 
vsi.pCommandBuffers = &CommandBukers[nextlmagelydex]; 
vsi.signalSemaphoreCount = 0; 
vsi.pSignalSemaphores = &SemaphoreRerNerFinished 


TYPE_SUBMIT_INFO; 


result = vkQueueSubmit( presentQueue, 1, IN &vsi, IN renderFence ); // 1 = submitCount 
result = vkWaitForFences( LogicalDevice, 1, IN &renderFence, VK TRUE, UINT64 MAX); // waitAll, timeout 


vkDestroyFence( LogicalDevice, renderFence, PALLOCATOR ); 


VkPresentInfoKHR С vei) 


vpi.sType = УК STRUCTURE TlYRE PRESENT INFO КНК; 
vpi.pNext = nullptr; 
vpi.waitSemaphoreCount = 0; 
vpi.pWaitSemaphores = (VkSemaphorà *)nullptr; 
vpi.swapchainCount = 1; 
vpi.pSwapchains = &SwapChain; 
vpi.plmagelndices = &nextlmagelndex; 
vpi.pResults = (VkResult *)nullptr; 


x THINK result - vkQueuePresentKHR( presentQueue, IN &vpi ); 
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What Happens After a Queue has Been Submitted? 206 


As the Vulkan 1.1 Specification says: 


“Command buffer submissions to a single queue respect submission order and other 
implicit ordering guarantees, but otherwise may overlap or execute out of order. Other 
types of batches and queue submissions against a single queue (e.g. sparse memory 
binding) have no implicit ordering constraints with any other queue submission or 
batch. Additional explicit ordering constraints between queue submissions and 
individual batches can be expressed with semaphores and fences.” 


In other words, the Vulkan driver on your system will execute the commands in a single buffer in 
the order in which they were put there. 


But, between different command buffers submitted to different queues, the driver is allowed to 
execute commands between buffers in-order or out-of-order or overlapped-order, depending on 
what it thinks it can get away with. 


The message here is, | think, always consider using some sort of Vulkan synchronization when 
one command depends on a previous command reaching a certain state first. 
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Vulkan. 


The Swap Chain 


Mike Bailey 


mjb@cs.oregonstate.edu 


http://cs.oregonstate.edu/~mjb/vulkan 
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208 
How OpenGL Thinks of Framebuffers 


кнн i | Refresh 
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How Vulkan Thinks of Framebuffers — the Swap Chain 


Present 


ғ 
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What is a Swap Chain? 210 
Vulkan does not use the idea of a “back buffer’. So, we need a place to render into 
before moving an image into place for viewing. The is called the Swap Chain. 
In essence, the Swap Chain manages one or more image objects that form a sequence 
of images that can be drawn into and then given to the Surface to be presented to the 
user for viewing. j 


Swap Chains are arranged as a ring buffer ------ = 


Swap Chains are tightly coupled to the window system. 


After creating the Swap Chain in the first place, the process for using the Swap Chain is: 


. Ask the Swap Chain for an image 

. Render into it via the Command Buffer and a Queue 

. Return the image to the Swap Chain for presentation 

. Present the image to the viewer (copy to “front buffer’) 


THINK Е. AER. 
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We Need to Find Out What our Display Capabilities Are 211 


VkSurfaceCapabilitiesK HR 
1.7 PhysicalDevice, Surface, OUT &vsc ) 


VkExtent2D surfaceRes = vsc.currentExtent; 
fprintf( FpDebug, "\nvkGetPhysicalDeviceSurfaceCapabilitiesKHR:\n" ); 


VkBool32 supported; 
result = vkGetPhysicalDeviceSurfaceSupportKHR( PhysicalDevice, FindQueueFamilyThatDoesGraphics( ), Surface, &supported ); 
if( supported == VK_TRUE ) 

fprintf( FpDebug, "** This Surface is supported by the Graphics Queue **\n" ); 


uint32_t formatCount; 

vkGetPhysicalDeviceSurfaceFormatsKHR( PhysicalDevice, Surface, &formatCount, (VkSurfaceFormatKHR *) nullptr ); 
VkSurfaceFormatKHR * surfaceFormats = new VkSurfaceFormatKHR[ formatCount |; 
vkGetPhysicalDeviceSurfaceFormatsKHR( PhysicalDevice, Surface, &formatCount, surfaceFormats ); 

fprintf( FpDebug, "\nFound %d Surface Formats:\n", formatCount ) 


uint32_t presentModeCount; 

vkGetPhysicalDeviceSurfacePresentModesKHR( PhysicalDevice, Surface, &presentModeCount, (VkPresentModeKHR *) nullptr ); 
VkPresentModeKHR * presentModes = new VkPresentModeKHR[ presentModeCount |; 
vkGetPhysicalDeviceSurfacePresentModesKHR( PhysicalDevice, Surface, &presentModeCount, presentModes ); 

fprintf( FpDebug, ^nFound %d Present Modes:\n", presentModeCount ); 
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We Need to Find Out What our Display Capabilities Are 212 


VulkanDebug.txt output: 


vkGetPhysicalDeviceSurfaceCapabilitiesKHR: 
minlmageCount = 2 ; maxlmageCount = 8 
currentExtent = 1024 x 1024 
minlmageExtent = 1024 x 1024 
maxlmageExtent = 1024 x 1024 
maxlmageArrayLayers = 1 
supportedTransforms = 0x0001 
currentTransform = 0x0001 
supportedCompositeAlpha = 0x0001 
supportedUsageFlags = 0x009f 


** This Surface is supported by the Graphics Queue ** 


Found 2 Surface Formats: 
0: 44 0 (VK FORMAT B8G8R8A8 UNORM, VK COLOR SPACE SRGB NONLINEAR KHR ) 
1: 50 0 (VK FORMAT B8G8R8A8 SRGB, VK COLOR SPACE SRGB NONLINEAR KHR ) 


Found 3 Present Modes: 


0 2 ( VK PRESENT. MODE FIFO KHR ) 
11 3 (УК PRESENT. MODE FIFO RELAXED КНК) 
2: 1 ( VK PRESENT. MODE MAILBOX KHR ) 


m 
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Creating a Swap Chain 213 


vkGetDevicePhysicalSurfaceCapabilities( ) 


VkSurfaceCapabilities 


WA 


surface minlmageCount 

imageFormat maxlmageCount 
imageColorSpace currentExtent 

imageExtent minlmageExtent 
imageArrayLayers maxlmageExtent 

imageUsage maxlmageArrayLayers 
imageSharingMode supportedTransforms 

preTransform currentTransform 

compositeAlpha supportedCompositeAlpha 
presentMode 
clipped 


VkSwapchainCreatelnfo 


vkCreateSwapchain( ) 


vkGetSwapChainlmages( ) 


vkCreatelmageView( ) 
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Creating a Swap Chain 214 


VkSurfaceCapabilitiesKHR 
vkGetPhysicalDeviceSurfaceCapabilitiesKHR( PhysicalDevice, Surface, OUT &vsc ); 


VkExtent2D surfaceRes = vsc.currentExtent; 


VkSwapchainCreatelnfoKHR 
vscci.sType = VK_STRUCTURE\ TYPE SWAPCHAIN CREATE INFO KHR; 
vscci.pNext = nullptr; 
vscci.flags = 0; 
vscci.surface = Surface; 
vscci.minlmageCount = 2; 
vscci.imageFormat = УК FORMAT 
vscci.imageColorSpace = УК COLORSPACE SRGB NONLINEAR KHR; 
vscci.imageExtent.width = surfaceReS,width; 
vscci.imageExtent.height = surfaceReg, height; 
vscci.imageUsage = VK IMAGE USAGE COLOR ATTACHMENT BIT; 
vscci.preTransform - V&K SURFACE TRANSFORM IDENTITY BIT KHR; 
vscci.compositeAlpha = УК COMPOSITE ALPHA OPAQUE ВІТ КНК; 
vscci.imageArrayLayers = 1; 
vscci.imageSharingMode = VK_SHARING\ MODE_EXCLUSIVE; 
vscci.queueFamilyIndexCount = 0; 
vscci.pQueueFamilyIndices = (const uint32 ү *)nullptr; 
vscci.presentMode = VK PRESENT MODE| MAILBOX KHR; 
vscci.oldSwapchain = ҮК NULL HANDLE; 
vscci.clipped = VK TRUE; 


II double buffering 
8G8R8A8_UNORM; 


result = vkCreateSwapchainKHR( LogicalDevice, IN &vscci, PALLOCATOR, OUT &SwapChain ); 
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Creating the Swap Chain Images and Image Views 


uint32_t imageCount; Il # of display buffers — 2? 3? 
result = vkGetSwapchainlmagesKHR( LogicalDevice, IN SwapChain, OUT &imageCount, (VkImage *)nullptr ); 


Presentlmages = new Vklmage[ imageCount ]; 
result = vkGetSwapchainlmagesKHR( LogicalDevice, SwapChain, OUT &imageCount, Presentlmages ); 


// present views for the double-buffering: 


PresentlmageViews = new VkImageView[ imageCount |; 


for( unsigned int i = 0; i < imageCount; i++ ) 


{ 


VkImageViewCreatelnfo 
vivci.sType = VK_STRU 
vivci.pNext = nullptr; 
vivci.flags = 0; 


«Uc» 


ОКЕ TYPE IMAGE VIEW. CREATE INFO; 


vivci.components.g = УК COMPONENT SWIZZLE С; 


vivci.subresourceRange.aspectMask - 
vivci.subresourceRange.baseMipLevel 
vivci.subresourceRange.levelCount = 1; 
vivci.subresourceRange.baseArrayLayer 
vivci.subresourceRange.layerCount = 1; 
vivci.image = Presentlmages[ 1]; 


result = vkCreatelmageView( LogicalDevice, IN &vivci, PALLOCATOR, OUT &PresentlmageViews[ i |); 
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Rendering into the Swap Chain, | 216 


VkSemaphoreCreatelnfo «О» 
vsci.sType = VK_STRUCTUR YRE SEMAPHORE CREATE INFO; 
vsci.pNext = nullptr; 
vsci.flags = 0; 


VkSemaphore imageReadySemaphore; 
result = vkCreateSemaphore( LogicalDevice, IN &vsci, PALLOCATOR, OUT &imageReadySemaphore ); 


uint32 t nextlmagelndex; 
иіпіб4 t tmeout = UINT64 MAX; 


vkAcquireNextlmageKHR( LogicalDevice, IN SwapChain, IN timeout, IN imageReadySemaphore, 
IN VK NULL HANDLE, OUT &nextlmagelndex ); 


result = vkBeginCommandBuffer( CommandBuffers[ nextlmagelndex |, IN &vcbbi ); 


vkCmdBeginRenderPass( CommandBuffers[nextlmagelndex], IN &vrpbi, 
IN VK SUBPASS CONTENTS INLINE ); 


vkCmaBindPipeline( CommandBuffers[nextlmagelndex], ҮК PIPELINE BIND POINT GRAPHICS, GraphicsPipeline ); 


vkCmdEndRenderPass( CommandBuffers[ nextlmagelndex ] ); 
vkEndCommandBuffer( CommandBuffers[ nextlmagelndex |); 
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Rendering into the Swap Chain, Il 217 


VkFenceCreatelnfo 
vfci.sType = VK_STRUCTURE_TYPE_FENCE_CREATE_INFO; 
vfci.pNext = nullptr; 
vfci.flags = 0; 


VkFence renderFence; 
vkCreateFence( LogicalDevice, &vfci, PALLOCATOR, OUT &renderFence ); 


VkQueue presentQueue; 
vkGetDeviceQueue( LogicalDevice, FindQueueFamilyThatDoesGraphics( ), 0, 
OUT &presentQueue ); 


VkSubmitlnfo 
vsi.sType = VK STRU 
vsi.pNext = nullptr; 
vsi.waitSemaphoreCount = \{; 
vsi.pWaitSemaphores = &imageReadySemaphore; 
vsi.pWaitDstStageMask = &waitAtBottom; 
vsi.commandBufferCount = 1; 
vsi.pCommandBuffers = «Сотта 
vsi.signalSemaphoreCount = 0; 
vsi.pSignalSemaphores = &SemaphdreRenderFinished; 


ÜRE TYPE SUBMIT. INFO; 


dBuffers[ nextlmagelndex |; 


result = vkQueueSubmit( presentQueue, 1, IN &vsi, ІМ renderFence ); //1 = submitCount 
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Rendering into the Swap Chain, Ill 218 


result = vkWaitForFences( LogicalDevice, 1, ІМ &renderFence, VK TRUE, UINT64 MAX ); 


VkPresentInfoKHR 
vpi.sType = VK_STRUCTURE_YYPE_PRESENT_INFO_KHR; 
vpi.pNext = nullptr; 
vpi.waitSemaphoreCount = 0; 
vpi.pWaitSemaphores = (VkSemaphore *)nullptr; 
vpi.swapchainCount = 1; 
vpi.pSwapchains = &SwapChain; 
vpi.plmagelndices = &nextlmagelnde 
vpi.pResults = (VkResult *) nullptr; 


result = vkQueuePresentKHR( presentQueue, IN &vpi ); 


H ru 
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Vulkan. 


Push Constants 


Mike Bailey 


mjb@cs.oregonstate.edu 


http://cs.oregonstate.edu/~mjb/vulkan 
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Push Constants 220 


In an effort to expand flexibility and retain efficiency, Vulkan provides something called 
Push Constants. Like the name implies, these let you “push” constant values out to the 
shaders. These are typically used for small, frequently-updated data values. This is good, 
since Vulkan, at times, makes it cumbersome to send changes to the graphics. 


By “small”, Vulkan specifies that these must be at least 128 bytes in size, although they 
can be larger. For example, the maximum size is 256 bytes on the NVIDIA 1080ti. (You 
can query this limit by looking at the maxPushConstantSize parameter in the 
VkPhysicalDeviceLimits structure.) Unlike uniform buffers and vertex buffers, these are 
not backed by memory. They are actually part of the Vulkan pipeline. 


THINK 
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Descriptor Set 


ae Creating a Pipeline 221 


which stage (VERTEX, etc.) 


bindi 
VkPipelineLayoutCreatelnfo ui 


stride nati 
Н location 
VkShaderModule inputRate binding 
format 
offset 


VkSpecializationInfo 


vkCreatePipelineLayout( ) VkVertexInputBindingDescription 


== VkVertexInputAttributeDescription 
VkPipelineShaderStageCreatelnfo 


VkPipelineVertexInputStateCreatelnfo Topology 
Shaders VkPipelinelnputAssemblyStateCreatelnfo 
VertexInput State L ae, x, y, w, h 
InputAssembly State minDepth, 


Tesselation State VkViewportStateCreatelnfo М 
Viewport State il] p 
i Т extent 
VkPipelineDepthStencilStateCreatelnfo cullMode 


Rasterization State 
MultiSample State 
polygonMode 
frontFace 


VkPipelineRasterizationStateCreatelnfo 
DepthStencil State 
ColorBlend State 
Dynamic State 
Pipeline layout 
RenderPass 
basePipelineHandle 
basePipelinelndex 


lineWidth 


depthTestEnable 
depthWriteEnable 
depthCompareOp 


VkPipelineColorBlendStateCreatelnfo 


VkGraphicsPipelineCreatelnfò stencilTestEnable 
stencilOpStateFront 
stencilOpStateBack 


VkPipelineColorBlendAttachmentState 


blendEnable 
А Y. ls srcColorBlendFactor 
vkCreateGraphicsPipeline( ) dstColorBlendFactor 
colorBlendOp 
srcAlphaBlendFactor 


е VkPipelineDynamicStateCreatelnfo dstAlphaBlendFactor 

raphics Pipelin alphaBlendOp 

е 2 SIGGRAPH THINK Graphics Pipeline |) - - colorWriteMask 
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Push Constants 222 


On the shader side, if, for example, you are sending a 4x4 matrix, the use of push 
constants in the shader looks like this: 


layout( push_constant ) uniform matrix 


mat4 modelMatrix; 
} Matrix; 


On the application side, push constants are pushed at the shaders by 
binding them to the Vulkan Command Buffer: 


vkCmdPushConstants( CommandBuffer, PipelineLayout, stageFlags, 
offset, size, pValues ); 


where: 
stageFlags are or'ed bits of VK PIPELINE STAGE VERTEX SHADER BIT, 
VK PIPELINE STAGE FRAGMENT SHADER BIT, etc. 


size is in bytes 


pValues is a void * pointer to the data, which, in this 4x4 matrix example, would be of type glm::mat4. 
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Setting up the Push Constants for the Pipeline Structure 


Prior to that, however, the pipeline layout needs to be told about the Push Constants: 


VkPushConstantRange Cyper[1]) 


vpcr[0].stageFlags = 
VK PIPELINE STAGE VERTEX SHADER BIT 
|УК PIPELINE STAGE FRAGMENT SHADER ВІТ; 
vpcr[0].offset = 0; 
vpcr[0].size = sizeof( glm::mat4 ); 


VkPipelineLayoutCreatelnfo 
vplci.s Type = VK STRUCTURE ТҮРЕ PIP E LAYOUT CREATE INFO; 
vplci.pNext = nullptr; 
vplci.flags = 0; 
vplci.setLayoutCount = 4; 
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An Robotic Example using Push Constants 224 


A robotic animation (i.e., a hierarchical transformation system) 


Where each arm is represented by: 


struct arm 


{ 


glm::mat4 armMatrix; 
glm::vec3 armColor; 
float armScale;  //scale factor in x 


È 


struct arm — Arm1; 
struct arm Агт2; 
struct arm — Arm3; 
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Forward Kinematics: 225 
You Start with Separate Pieces, all Defined in their Own Local Coordinate System 


» THINK 
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Forward Kinematics: 226 
Hook the Pieces Together, Change Parameters, and Things Move 
(All Young Children Understand This) 
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Forward Kinematics: 227 
Given the Lengths and Angles, Where do the Pieces Move To? 


Locations? 


Ground 


© SIGGRAPH IW. | 
- тір — July 24, 2020 


228 
Positioning Part #1 With Respect to Ground 


1. Rotate by O1 
2. Translate by Т, 


Write it 


М.с | - kn | l [RA | 


Say it 
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229 
Positioning Part #2 With Respect to Ground 


1. Rotate by O2 

2. Translate the length of part 1 

3. Rotate byO 1 

4. Translate by Т,» Write it 


"lll: ыы m 
“о TR 
. A 


[Ms = [Ти ]* МЕЛ 
om x TC 


(М.с = M, ]*[M,,] 


Say it 
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230 
Positioning Part #3 With Respect to Ground 


1. Rotate ру ОЗ 

2. Translate the length of part 2 

3. Rotate by O2 

4. Translate the length of part 1 

5. Rotate by O1 

6. Translate by Ту; — 


и... а 
" NO TS 
"SEMEN L о 7% 
ж an aN 
d 4, 


Ж 
D 


a 
a . 
NE ңа | 5а... 2 
il cC". wr GEM M nn ` : а” 
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In the Reset Function 231 


struct arm Arm1; 
struct arm Arm2; 
struct arm Arm3; 


Arm1.armMatrix = glm::mat4( 1. ); 
Arm1.armColor = glm::vec3( 0.f, 1.f, 
Arm1.armScale = 6.f; 


The constructor glm::mat4( 1. ) 
produces an identity matrix. The 


Arm2.armMatrix = glm::mat4( 1. ); 
Arm2.armColor = glm::vec3( 1.f, 0.f, 0.f ); 
Arm2.armScale = 4.f; 


actual transformation matrices will 
be set in UpdateScene( ). 


Arm3.armMatrix = glm::mat4( 1. ); 
Arm3.armColor = glm::vec3( 0.f, O.f, 1.f ); 
Arm3.armScale = 2.f; 
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Setup the Push Constant for the Pipeline Structure 232 


VkPushConstantRange 
vpcr[0].stageFlags = 
VK_PIPELINE_STAGE|VERTEX_SHADER_BIT 
| VK PIPELINE 5ТАСЕ|ҒКАСМЕМТ SHADER ВІТ; 


vpcr[O].offset = 0; 
vpcr[0].size = sizeof( struct arm ) 

€ pi : IPELINE LAYOUT CREATE INFO; 
vplci.pSetLayouts = DescriptorSetLayouts; 
vplci.pushConstantRangeCount # 1; 
vplci.pPushConstantRanges = vpcr; 


VkPipelineLayoutCreatelnfo 
vplci.s Type = УК STRUCTURE _ 
vplci.pNext = nullptr; 
vplci.flags = 0; 
vplci.setLayoutCount - 4; 


result = vkCreatePipelineLayout( LogicalDevice, IN &vplci, PALLOCATOR, 
OUT &GraphicsPipelineLayout ); 


H ru 
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In the UpdateScene Functiom 233 


float rot1 = (float)Time; 
float rot2 = 2.1 * rot1; 
float rot3 = 2.f * rot2; 


glm::vec3 zaxis = glm::vec3(0., 0., 1.); 
glm::mat4 m1g = glm::mat4( 1. ); // identity 
mig = glm::translate(m1g, glm::vec3(0., 0., 0.)); 


mig = glm::rotate(m1g, го, zaxis); 


glm::mat4 m21 = glm::mat4( 1. ); // identity 


Il ПТК] 


m21 = glm::translate(m21, glm::vec3(2.*Arm1.armScale, 0., 0.)); 


m21 = glm::rotate(m21, rot2, zaxis); 
m21 = glm::translate(m21, glm::vec3(0., 0., 2.)); 


glm::mat4 m32 = glm::mat4( 1. ); // identity 


ІПТТІКІ 
Il z-offset from previous arm 


m32 = glm::translate(m32, glm::vec3(2.*Arm2.armScale, 0., 0.)); 


m32 = glm::rotate(m32, rot3, zaxis); 
m32 = glm::translate(m32, glm::vec3(0., 0., 2.)); 


Arm1.armMatrix = m1g; 
Arm2.armMatrix = m1g * m21; 
Arm3.armMatrix = m1g * m21 * m32; 


ІІ m1g 
ІІ m2g 
ІІ m3g 


ІПТТІКІ 
Il z-offset from previous arm 


mjb — July 24, 2020 


In the RenderScene Function 234 


VkBuffer buffers[1] = { MyVertexDataBuffer.buffer }; 
vkCmaBindVertexBuffers( CommandBuffers[nextlmagelndex], 0, 1, buffers, offsets ); 
vkCmdPushConstants( CommandBuffers[nextlmagelndex], GraphicsPipelineLayout, 


VK SHADER STAGE ALL, 0, sizeof(struct arm), (void *)&Arm1 ); 
vkCmdDraw( CommandBuffers[nextlmagelndex], vertexCount, instariceCount, firstVertex, firstInstance ); 


vkCmdPushConstants( CommandBuffers[nextlmagelndex], GraphicsPipelineLayout, 
VK SHADER STAGE ALL, 0, sizeof(struct arm), (void/*)&Armz2 ); 
vkCmdDraw( CommandBuffers[nextlmagelndex], vertexCount, instanceCount, firstVertex, firstlnstance ); 


vkCmdPushConstants( CommandBuffers[nextImagelndex], GraphicgPipelineLayout, 
VK SHADER STAGE ALL, 0, sizeof(struct arm), (Void *)&Arm3 ); 
vkCmdDraw( CommandBuffers[nextlmagelndex], vertexCount, insfaneeCount, firstVertex, firstInstance ); 


The strategy is to draw each link using the same 
vertex buffer, but modified with a unique color, 
length, and matrix transformation 
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In the Vertex Shader 235 


layout( push_constant ) uniform arm 


тай armMatrix; 

vec3 armColor; 

float armScale; // scale factor in x 
) RobotArm; 


layout( location = 0 ) in vec3 aVertex; 


vec3 bVertex = aVertex; Il arm coordinate system is [-1., 1.] in X 
bVertex.x += 1.; II now is [0., 2.) 
bVertex.x /= 2.; II now is [O., 1.) 
bVertex.x *= (RobotArm.armScale ); // now is [0., RobotArm.armScale] 


bVertex = vec3( RobotArm.armMatrix * vec4( bVertex, 1. ) ); 


gl Position = РУМ“ vec4( bVertex, 1. ); Il Projection * Viewing * Modeling matrices 


H ru 
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Vulkan: a More Typical (and Simplified) Block Diagram dd 


ymmand Biuffar 
ommand Buffer 
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Querying the Number of Physical Devices oe 


uint32_t count; 
result = vkEnumeratePhysicalDevices( Instance, OUT &count, OUT (VkPhysicalDevice *)nullptr ); 


VkPhysicalDevice * physicalDevices = new VkPhysicalDevice[ count ]; 
result = vkEnumeratePhysicalDevices( Instance, OUT &count, OUT physicalDevices ); 


This way of querying information is a recurring OpenCL and Vulkan pattern (get used to it): 


How many total Where to 
there are put them 
result = vkEnumeratePhysicalDevices( Instance, &count, пиріг ); 


result = vkEnumeratePhysicalDevices( Instance, &count, physicalDevices ); 
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Vulkan: Identifying the Physical Devices 239 


VkResult result = VK_SUCCESS; 


result = vkEnumeratePhysicalDevices( Instance, OUT &PhysicalDeviceCount, (VkPhysicalDevice *)nullptr ); 
= if( result != VK_SUCCESS || PhysicalDeviceCount <= 0 ) 
| Application | = 
{ 
fprintf( FpDebug, "Could not count the physical devices\n" ); 
return VK_SHOULD_EXIT; 


Physical } 
Device 


fprintf(FpDebug, "\n%d physical devices found.\n", PhysicalDeviceCount); 


VkPhysicalDevice * physicalDevices = new VkPhysicalDevice[ PhysicalDeviceCount ]; 
result = vkEnumeratePhysicalDevices( Instance, OUT &PhysicalDeviceCount, OUT physicalDevices ); 
if( result != VK SUCCESS ) 


fprintf( FpDebug, "Could not enumerate the %d physical devices\n", PhysicalDeviceCount ); 
return VK_SHOULD_EXIT; 
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Which Physical Device to Use, | 240 


int discreteSelect = -1; 

int integratedSelect = -1; 

for( unsigned int i = 0; i < PhysicalDeviceCount; i++ ) 

{ 
VkPhysicalDeviceProperties vpdp; 
vkGetPhysicalDeviceProperties( IN physicalDevices[i], OUT &vpdp ); 
ІК result != МК SUCCESS ) 


fprintf( FpDebug, "Could not get the physical device properties of device %d\n", 1); 
return VK_SHOULD_EXIT; 


} 


fprintf( FoDebug, " Device %2d:\n", і); 

fprintf( FpDebug, "\tAPI version: %d\n", vpdp.apiVersion ); 

fprintf( FpDebug, "Driver version: %d\n", vpdp.apiVersion ); 

fprintf( FpDebug, "\tVendor ID: 0x%04x\n", vpdp.vendorlD ); 

fprintf( FpDebug, "\tDevice ID: 0x%04x\n", vpdp.devicelD ); 

fprintf( FpDebug, "\tPhysical Device Type: %d =", vpdp.deviceType ); 

ІК vpdp.deviceType == VK_PHYSICAL_DEVICE_TYPE_DISCRETE_GPU ) fprintf( FpDebug, " (Discrete GPU)\n" ); 
ІК vpdp.deviceType == VK PHYSICAL DEVICE TYPE INTEGRATED GPU ) fprintf( FpDebug, " (Integrated GPU)\n" ); 
і vpdp.deviceType == VK PHYSICAL DEVICE TYPE VIRTUAL ОРУ) fprintf( FpDebug, " (Virtual GPU)\n" ); 
if( vpdp.deviceType == VK PHYSICAL DEVICE TYPE CPU) fprintf( FpDebug, " (CPU)\n" ); 

fprintf( FoDebug, "\tDevice Name: %s\n", vpdp.deviceName ); 

рип FpDebug, "\tPipeline Cache Size: %d\n", vpdp.pipelineCacheUUID[0] ); 
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Which Physical Device to Use, Il 241 


Il need some logical here to decide which physical device to select: 


if( vpdp.deviceType == VK PHYSICAL DEVICE TYPE DISCRETE GPU ) 
discreteSelect = i; 


if( vpdp.deviceType == VK PHYSICAL DEVICE TYPE INTEGRATED GPU) 
integratedSelect - i; 


} 


int which = -1; 
if( discreteSelect >= 0 ) 


which = discreteSelect; 
PhysicalDevice = physicalDevices[which]; 


else ІК integratedSelect >= 0) 
{ 

which = integratedSelect; 

PhysicalDevice = physicalDevices[which]; 
} 


else 


fprintf( FoDebug, "Could not select a Physical Device\n" ); 
return V&K SHOULD EXIT; 
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Asking About the Physical Device’s Features 242 


VkPhysicalDeviceProperties PhysicalDeviceFeatures; 


vkGetPhysicalDeviceFeatures( IN PhysicalDevice, OUT &PhysicalDeviceFeatures ); 


fprintf( FpDebug, ^nPhysical Device Features:\n"); 

fprintf( FpDebug, "geometryShader = %2d\n", PhysicalDeviceFeatures.geometryShader); 

fprintf( FpDebug, "tessellationShader = %2d\n", PhysicalDeviceFeatures.tessellationShader ); 

fprintf( FpDebug, "multiDrawIndirect = %2d\n", PhysicalDeviceFeatures.multiDrawIndirect ); 

fprintf( FpDebug, "wideLines = %2d\n", PhysicalDeviceFeatures.wideLines ); 

fprintf( FpDebug, "largePoints = %2d\n", PhysicalDeviceFeatures.largePoints ); 

fprintf( FpDebug, "multiViewport = %2d\n", PhysicalDeviceFeatures.multiViewport ); 

fprintf( FpDebug, "occlusionQueryPrecise = %2d\n", PhysicalDeviceFeatures.occlusionQueryPrecise ); 
fprintf( FpDebug, "pipelineStatisticsQuery = %2d\n", PhysicalDeviceFeatures.pipelineStatisticsQuery ); 
fprintf( FpDebug, "shaderFloat64 = %2d\n", PhysicalDeviceFeatures.shaderFloat64 ); 

fprintf( FpDebug, "shaderlnt64 = %2d\n", PhysicalDeviceFeatures.shaderlnt64 ); 

fprintf( FpDebug, "shaderlnt16 = %2d\n", PhysicalDeviceFeatures.shaderlnt16 ); 
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Here’s What the NVIDIA RTX 2080 Ti Produced 243 


vkEnumeratePhysicalDevices: 


Device 0: 
API version: 4198499 
Driver version: 4198499 
Vendor ID: 0x10de 
Device ID: 0x1e04 
Physical Device Type: 2 = (Discrete GPU) 
Device Name: RTX 2080 Ti 
Pipeline Cache Size: 206 
Device #0 selected (‘RTX 2080 ТІ”) 


Physical Device Features: 
geometryShader = 1 
tessellationShader = 1 
multiDrawlndirect = 1 
wideLines = 1 

largePoints = 1 
multiViewport = 1 
occlusionQueryPrecise = 1 
pipelineStatisticsQuery = 1 
shaderFloat64 = 1 
shaderlnt64 = 1 
shaderlnt16 = 1 
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Here’s What the Intel HD Graphics 520 Produced 244 


vkEnumeratePhysicalDevices: 


Device 0: 
API version: 4194360 
Driver version: 4194360 
Vendor ID: 0x8086 
Device ID: 0x1916 
Physical Device Type: 1 = (Integrated GPU) 
Device Name: Intel(R) HD Graphics 520 
Pipeline Cache Size: 213 
Device #0 selected ('Intel(R) HD Graphics 520") 


Physical Device Features: 
geometryShader = 1 
tessellationShader = 1 
multiDrawIndirect = 1 
wideLines = 1 

largePoints = 1 
multiViewport = 1 
occlusionQueryPrecise = 1 
pipelineStatisticsQuery = 1 
shaderFloat64 = 1 
shaderlnt64 = 1 
shaderlnt16 = 1 
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Asking About the Physical Device’s Different Memories 245 


VkPhysicalDeviceMemoryProperties vpdmp; 
vkGetPhysicalDeviceMemoryProperties( PhysicalDevice, OUT &vpdmp ); 


fprintf( FpDebug, ^n9ed Memory Types:\n", vpdmp.memoryTypeCount ); 
for( unsigned int i = 0; i < vpdmp.memoryTypeCount; i++ ) 


{ 
VkMemoryType vmt = vpdmp.memoryTypesfi]; 
fprintf( FpDebug, "Memory %2а: ", i); 
ІК ( vmt.propertyFlags & VK_ MEMORY _ PROPERTY DEVICE LOCAL BIT  )!-0) fprintf( FpDebug, " DeviceLocal" ); 
ІК ( vmt.propertyFlags & МК MEMORY PROPERTY HOST VISIBLE BIT )!=0) fprintf( FpDebug, " HostVisible" ); 
ІК ( vmt.propertyFlags & VK MEMORY PROPERTY HOST COHERENT BIT )!-0) fprintf( FpDebug, " HostCoherent" ); 
if( ( vmt.propertyFlags & VK MEMORY PROPERTY HOST CACHED BIT )!=0) fprintf( FpDebug, " HostCached" ); 
ІК ( vmt.propertyFlags & МК MEMORY PROPERTY LAZILY ALLOCATED BIT )!= 0)  fprintf( FpDebug, " LazilyAllocated" ); 
fprintf(FpDebug, ^n"); 
} 


fprintf( FpDebug, ^n9ed Memory Heaps:\n", vpdmp.memoryHeapCount ); 
for( unsigned int i = 0; i < vpdmp.memoryHeapCount; i++ ) 


{ 
fprintf(FpDebug, "Heap %d: ", i); 
VkMemoryHeap vmh = vpdmp.memoryHeaps[i]; 
fprintf( FoDebug, " size = 0х%081х", (unsigned long int)vmh.size ); 
ІК ( vmh.flags & VK MEMORY HEAP DEVICE LOCAL BIT )!-0)  fprintf( FoDebug, " DeviceLocal" ); //only опе in use 
fprintf(FpDebug, ^n"); 
} 
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Here’s What | Got 246 


11 Memory Types: 

Memory 0: 

Memory 7: 

Memory 2 

Memory 3 

Memory 4 

Memory 5: 

Memory 6: 

Memory 7: DeviceLocal 

Memory 8: DeviceLocal 

Memory 9: HostVisible HostCoherent 
Memory 10: HostVisible HostCoherent HostCached 
2 Memory Heaps: 

Heap 0: size = 0xb7c00000 DeviceLocal 
Heap 1: size = 0xfac00000 
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uint32 t count = -1; 
vkGetPhysicalDeviceQueueFamilyProperties( IN PhysicalDevice, &count, OUT (VkQueueFamilyProperties *)nullptr ); 
fprintf( FpDebug, ^nFound %d Queue Families:\n", count ); 


VkQueueFamilyProperties *vqfp = new VkQueueFamilyProperties[ count |; 
vkGetPhysicalDeviceQueueFamilyProperties( IN PhysicalDevice, &count, OUT vafp ); 
for( unsigned int i = 0; i < count; i++ ) 


{ 
рип FpDebug, "\t%d: queueCount = %2d ; ", i, vqfp[i].queueCount ); 
ІК ( vqfp[i].queueFlags & VK_QUEUE_GRAPHICS ВІТ ) !=0 ) fprintf( FpDebug, " Graphics" ); 
ІК ( vqfp[i].queueFlags & VK_QUEUE_COMPUTE ВІТ )!=0) fprintf( FpDebug, " Compute " ); 
ІК ( vqfp[i].queueFlags & VK_QUEUE_TRANSFER BIT ) !=0 ) fprintf( FpDebug, " Transfer" ); 
fprintf(FpDebug, "\п"); 

} 


THINK 
H BEYOND mjb — July 24, 2020 


Here's What I Got 248 


Found 3 Queue Families: 
0: queueCount = 16 ; Graphics Compute Transfer 
1: queueCount= 2; Transfer 
2:queueCount- 8 ; Compute 
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Vulkan: a More Typical (and Simplified) Block Diagram dn 


ymmand Biuffar 
ommand Buffer 
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Looking to See What Device Layers are Available 251 


const char * myDeviceLayers[ ] = 


{ 
I| VK LAYER LUNARG api dump", 
I| VK LAYER LUNARG core validation", 
I| VK LAYER LUNARG image", 
"VK LAYER LUNARG object tracker", 
"VK LAYER LUNARG parameter validation", 
Il VK LAYER NV optimus" 
E 
const char * myDeviceExtensions[ ] = 
{ 
"VK KHR surface", 
"VK KHR win32 surface", 
"VK EXT debug report" 
Il VK KHR swapchains" 
Б 


Il see what device layers are available: 


uint32_t layerCount; 
vkEnumerateDeviceLayerProperties(PhysicalDevice, &layerCount, (VkLayerProperties *)nullptr); 


VkLayerProperties * deviceLayers = new VkLayerProperties[layerCount]; 


result = vkEnumerateDeviceLayerProperties( PhysicalDevice, &layerCount, deviceLayers); 
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Looking to See What Device Extensions are Available 252 


// see what device extensions are available: 


uint32_t extensionCount; 
vkEnumerateDeviceExtensionProperties(PhysicalDevice, deviceLayers[i].layerName, 
&extensionCount, (VkExtensionProperties *)nullptr); 


VkExtensionProperties * deviceExtensions = new VkExtensionProperties[extensionCount]; 


result = vkEnumerateDeviceExtensionProperties(PhysicalDevice, deviceLayers[i].layerName, 
&extensionCount, deviceExtensions); 
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What Device Layers and Extensions are Available 253 


4 physical device layers enumerated: 


0x00401063 1 'VK LAYER NV optimus' 'NVIDIA Optimus layer 
0 device extensions enumerated for VK LAYER NV optimus": 


0x00401072 1 “К LAYER LUNARG core validation' 'LunarG Validation Layer 
2 device extensions enumerated for VK LAYER LUNARG core validation": 
0x00000001 'VK EXT validation cache' 
0x00000004 'VK EXT debug marker 


0x00401072 1 “К LAYER LUNARQG object tracker' 'LunarG Validation Layer 
2 device extensions enumerated for VK LAYER LUNARG object tracker: 
0x00000001 'VK EXT validation cache' 
0x00000004 'VK EXT debug marker 


0x00401072 1 “К LAYER LUNARG parameter validation' 'LunarG Validation Layer 
2 device extensions enumerated for VK LAYER LUNARG parameter validation': 
0x00000001 'VK EXT validation cache' 
0x00000004 'VK EXT debug marker 
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Vulkan: Creating a Logical Device 254 


float queuePriorities[1] = 


{ 
1. - 
Physical 
Y Device 


VkDeviceQueueCreatelnfo 
vdqci.sType = VK STRU RE TYPE DEVICE QUEUE CREATE INFO; 
vdqci.pNext = nullptr; 
vdqci.flags = 0; 
vdqci.queueFamiltyIndex = Ñ; 
vdqci.queueCount = 1; 
vdqci.pQueueProperties = quguePriorities; 


VkDeviceCreatelnfdC. vdci; ә 


vdci.sType = VK ‘STRUCTURE TYPE DEVICE CREATE INFO; 
vdci.pNext = nullptr 


Logical 
Device 


vdci.flags = 0; 
vdci.queueCreatelnfoCoWnt = 1; Il # of device queues 
vdci.pQueueCreatelnfos = NN vdqci; Il array of VkDeviceQueueCreatelnfo's 


vdci.enabledLayerCount = si2eof(myDeviceLayers) / sizeof(char *); 
vdci.enabledLayerCount = 0; 

vdci.ppEnabledLayerNames = myNeviceLayers; 

vdci.enabledExtensionCount = 0; 

vdci.ppEnabledExtensionNames = (const char **)nullptr; Il no extensons 
vdci.enabledExtensionCount = sizeof(myDeviceExtensions) / sizeof(char *); 
vdci.ppEnabledExtensionNames = myDevic&Extensions; 

vdci.pEnabledFeatures = IN &PhysicalDevicePeatures; 


result = vkCreateLogicalDevice( PhysicalDevice, IN &vdci, PALLOCATOR, OUT &LogicalDevice ); 
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Vulkan: Creating the Logical Device’s Queue 255 


Il get the queue for this logical device: 


vkGetDeviceQueue( LogicalDevice, 0,0, OUT &Queue ); Il 0, 0 = queueFamilyIndex, queuelndex 
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